Methods and systems for assessing infertility and related pathologies

ABSTRACT

Methods for assessing infertility and related pathologies and informing treatment type and timing thereof are provided. According to certain embodiments, methods of the invention include determining levels of one or more transcripts present in a sample obtained from a subject suspected of having endometriosis, identifying transcript levels that correspond to a regulation pattern specific to a time-point in a uterine cycle, and characterizing endometriosis of the subject based upon the identified transcript levels. The invention includes methods for assessing age-associated increase in aneuploidy rates based on FSH levels and IVF success rates based on obesity in PCOS patients.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No. 14/802,609, filed Jul. 17, 2015, which claims the benefit of U.S. Provisional Application No. 62/025,802, filed Jul. 17, 2014 and U.S. Provisional Application No. 62/065,416, filed Oct. 17, 2014, each of which is incorporated by reference in its entirety.

BACKGROUND

According to the Centers for Disease Control and Prevention, 6.7 million women (around 10.9%) in the United States between the ages of 15 and 44 suffer from impaired fecundity, or the ability to become pregnant and carry a baby to term. See Chandra A, Copen C E, Stephen E H. Infertility and impaired fecundity in the United States, 1982-2010: Data from the National Survey of Family Growth. National health statistics reports; no 67. Hyattsville, Md.: National Center for Health Statistics, 2013. A variety of factors such as endometriosis, high rates of aneuploidy embryos, and polycystic ovary syndrome (PCOS) can contribute to impaired fecundity and understanding these causes on a case-by-case basis can help inform treatment decisions.

Endometriosis affects 10% to 15% of reproductive-age women. Symptoms of endometriosis may include infertility, chronic pelvic pain, irregular uterine bleeding, dysmenorrhea, and/or dyspareunia. Endometriosis is characterized by the abnormal growth of endometrial tissue, which normally lines the inside of one's uterus, on the outside of one's uterus. The displaced endometrial tissue may spread to one's ovaries, bowels, or pelvic tissue, and, in some cases, continues to act like normal intrauterine endometrial tissue during one's uterine cycle—by thickening, breaking down, and bleeding. The uterine cycle is regulated by hormones and has three major phases: the menstrual phase, the proliferative phase, and the secretory phase. The secretory phase is often further broken down into the early secretory stage, mid-secretory phase, and late secretory phase. Symptoms and severity vary by case along with the need for fertility treatment and the likelihood of success thereof.

The cause of endometriosis is unclear. The most widely-accepted explanation of endometriosis is retrograde menstruation. Retrograde menstruation occurs when menstrual blood containing endometrial cells flows back through the fallopian tubes and into the pelvic cavity, as opposed to flowing out the body. The endometrial cells present in the back flow are believed to stick to the pelvic walls and surfaces of the pelvic organs, where they continue to proliferate. Other proffered causes of endometriosis include embryonic cell growth, surgical scar implantation, endometrial cell transport, or an immune system disorder.

Expression studies for examining genes associated with endometriosis have provided further understanding of its etiology. For example, the expression studies have indicated that misregulation of a number of molecular pathways are associated with endometriosis. While expression studies offer insight as to what genes correlate with endometriosis, there has yet to be a consistent approach that allows one to characterize endometriosis or inform treatment of endometriosis based on expression levels and regulation patterns.

PCOS is a common endocrine system disorder with symptoms that may include irregular or no menstrual periods, heavy periods, excess body and facial hair, acne, pelvic pain, trouble getting pregnant, and patches of thick, darker, velvety skin. Impaired fecundity resulting from PCOS may be treated using a number of methods including diet adjustments, ovulation-inducing medications, surgical intervention, and assisted reproductive techniques such as in-vitro fertilization (IVF). For women with PCOS, like other disorders affecting fertility, success rates for these treatments vary on a case-by-case basis and are not generally predictable and understood.

Aneuploidy is the presence of an abnormal number of chromosomes in a cell. High aneuploidy rates are often associated with poor oocyte and embryo quality, both of which decrease with age and often lead to unviable embryos and, accordingly, impaired fecundity. While aneuploidy rates appears to increase with a woman's age, the association has not been well characterized and the ability to predict aneuploidy rates for a given individual would be useful in informing family planning and possible fertility treatment.

As noted, many cases of impaired fecundity are treatable, allowing a woman to become pregnant and carry a baby to term. Some methods, such as IVF, can be expensive and painful while not necessarily producing the desired outcome. Accordingly, providing an accurate picture of an individual patient's likelihood of success with a given treatment method and equipping the patient to maximize that likelihood is extremely important before undertaking a treatment regimen.

SUMMARY

The invention relates to methods and systems for assessing infertility and related pathologies, including endometriosis, PCOS, and high aneuploidy rates. The invention includes systems and methods for assessing endometriosis and informing course of treatment. Aspects of the invention include identifying genetic signatures of endometriosis that correlate to the various phases of a woman's uterine cycle. In certain embodiments, a woman's phase-specific endometriosis signatures are identified by comparing the patient's genomic expression data to reference phase-specific expression patterns associated with endometriosis. The phase-specific endometriosis signatures are utilized to provide accurate diagnostics (e.g. determine phase of a patient's uterine cycle or determine type/severity of the endometriosis), tailor treatment based on the phase-specific endometriosis signature, and/or tailor treatment to coincide with a phase of interest.

Systems and methods of the invention also relate to assessing risk of IVF failure in patients with PCOS. In general, methods include identifying obese patients suffering from PCOS through a measure such as body mass index (BMI) and predicting likelihood of implantation, clinical pregnancy, and/or live birth outcomes in IVF treatment. The invention includes systems and methods for assessing an individual's risk of producing an aneuploidy embryo based on factors including age and follicle-stimulating hormone (FSH) levels.

According to certain aspects, phase-specific genetic signatures for a patient are determined by identifying the patient's gene expression levels that correspond to a regulation pattern associated with a specific phase of the uterine cycle. The regulation pattern may be indicative of an endometriotic condition or a non-endometriotic condition. The regulation pattern specific to the uterine cycle may be obtained from a consensus data set that incorporates data from one or more sources, including a certain patient population, publications, studies, and data repositories (including protein-protein interactions and tissue expression patterns). In particular embodiments, the regulation pattern includes statistically-significant expression patterns associated with endometriosis obtained from the consensus data set. In certain embodiments, a meta-analysis is performed on the consensus data set to determine the regulation pattern. The meta-analysis may process and filter data based on a number a variables, such ectopic and/or eutopic tissue, the phases of the uterine cycle, particular patient populations, e.g. infertile/not infertile, positive/negative diagnosis for endometriosis, location of the ectopic tissue, pain and other endometriosis-associated symptoms.

In some embodiments, the invention provides methods for assessing endometriosis that include conducting a laboratory procedure to determining levels of transcripts present in a sample obtained from a patient who is suspected of having endometriosis, and identifying transcript levels that correspond to a regulation pattern specific to a time-point in the patient's uterine cycle. In some embodiments, the time-point of the regulation pattern is a phase of the uterine cycle. The identified transcript levels of the patient are then used to characterize endometriosis. The characterization may include determining the phase(s) of the subject's uterine cycle based on the identified transcripts. Additionally, the characterization may include determining the type/stage of the endometriosis based on the identified transcripts. In further embodiments, the method may further include determining the type of treatment for the endometriosis (e.g. a drug or therapeutic that targets the gene or the biochemical pathways associated with the gene) or timing of the treatment based on the characterization (e.g., during a certain phase of the uterine cycle).

Other aspects involve methods for targeting treatment of endometriosis. In certain embodiments, such methods include determining expression levels of one or more genes over different time-points during a subject's uterine cycle, identifying a time point during the uterine cycle in which expression levels are not synchronous or are dissimilar with respect to a non-endometriotic condition. For example, the subject may have differentially expressed genes at a certain phase—in circumstances where a subject's genes are regulation pattern (i.e. up-regulated/de-regulated) during the proliferative phase is different from the non-endometriodic regulation patterns at the proliferative phase. A course of treatment may then be indicated to coincide with the phases where the misregulation is indicated. In addition, a course of treatment may be indicated that based on the misregulation, e.g. a drug or therapeutic that targets the gene or the biochemical pathways associated with the gene.

Further embodiments involve determining genetic signatures of a patient across the various phases of the patient's uterine cycle in order to classify endometriosis. Such methods include determining expression levels of one or more transcripts in a sample obtained from a subject with endometriosis across different time-points of the subject's uterine cycle. The determined transcript levels are then compared to reference transcript levels corresponding to different time-points of the uterine cycle. The reference transcript level may be the consensus expression level of one or more transcripts obtained from a population of certain subjects. The subjects that make up the population for the reference level may be chosen based on certain phenotypic traits—e.g., positive for endometriosis, negative for endometriosis, infertile, fertile, certain age or weight, etc. Based on the comparison, differential transcripts at each time point of the uterine cycle are determined. The differential transcripts at each time point are considered the subject's genetic signature for the respective time points. The subject's genetic signature can then be used to classify endometriosis, e.g., determine the type/stage of the endometriosis, and to determine a course of treatment specific to the subject's genetic signatures.

Certain aspects of the invention include an array for assessing endometriosis. The array includes a substrate and a plurality of oligonucleotides attached to the substrate at discrete addressable positions. At least one of the oligonucleotides hybridizes to a portion of one of the following genes: CCL3L1, CCL3, FAM180A, THBS2, PDGFRL, FN1, CLE11A, CCNA2, KIF20A, BUB1B, HSD17B6, HSD11B1, C7, C3, CXCL2, CXCL12, CXCL13, PDGFC, CXCL14, ACTA2, TAGLN, and SORBS1.

In certain aspects, systems and methods of the invention relate to determining that a patient has a decreased probability of successful IVF treatment where the patient is diagnosed with PCOS and the patient's BMI is greater than or equal to a threshold level. In certain embodiments the threshold level may be 30 kg/m².

In certain aspects, methods of the invention relate to assessing future aneuploidy rates. The methods includes conducting a laboratory procedure to determine a follicle stimulating hormone (FSH) level in a sample obtained from an individual and matching the FSH level with the individual's age. The method also includes the steps of identifying a prospective risk of producing an aneuploidy embryo at a given age based upon said matching step.

In certain embodiments, the sample may include blood or urine obtained from the individual. The matching step may include comparing the FSH level to a threshold level. In various embodiments, where the FSH level is below the threshold level, prospective risk may be identified by taking an initial risk of producing an aneuploidy embryo and increasing that risk by about 10% for each year of the individual's age above puberty. In alternative methods, where the FSH level is above the threshold level, prospective risk may be identified by taking an initial risk of producing an aneuploidy embryo and increasing that risk by about 15% for each year of the individual's age above puberty. In certain embodiments, the threshold level may be about 13 mUI/mL. Methods may include preparing a written report recommending an accelerated course of treatment for the individual or preparing a written report recommending oocyte retrieval and cryopreservation. In certain embodiments, methods may include retrieving and cryopreserving an oocyte from the individual where the FSH level is greater than the threshold level.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 illustrates the thickness changes of the endometrial lining as the uterine cycle progresses from the proliferative stage to the late secretory stage.

FIG. 2 provides a schematic illustration of the meta-analysis process.

FIG. 3 illustrates retrograde menstruation.

FIG. 4 illustrates conditions associated with endometriosis.

FIG. 5 illustrates various parameters examined in the meta-analysis of data and associated with endometriosis.

FIG. 6A illustrates gene expression of the eutopic endometrium of the samples across the proliferative, early secretory, mid-secretory, and late secretory phases.

FIG. 6B illustrates gene expression of the ectopic endometrium of endometriosis positive samples across the proliferative, early secretory, mid-secretory, and late secretory phases.

FIG. 7 illustrates the phase-specific genetic signature differences between endometriosis and normal populations.

FIG. 8 illustrates up-regulated and de-regulated genes associated with endometriosis at the proliferative stage.

FIG. 9 illustrates up-regulated and de-regulated genes associated with endometriosis at the early secretory stage.

FIG. 10 illustrates up-regulated genes associated with endometriosis at the mid- to late secretory phase.

FIG. 11 illustrates the percentage of cohort for several diagnosis groups.

FIG. 12 illustrates methods for analyzing multi-cycle IVF data.

FIG. 13 illustrates the impact of obesity on oocyte retrieval, number of viable embryos, implantation rate, and live birth outcomes for several diagnosis groups.

FIG. 14 illustrates a system for performing methods of the invention.

DETAILED DESCRIPTION

The invention generally relates to methods and systems for assessing endometriosis in a subject and informing course of treatment. Aspects of the invention include identifying genomic signatures for endometriosis that correlate to the various phases of a woman's uterine cycle. The phase-specific endometriosis signatures are utilized to provide accurate diagnostics (e.g. determine phase of a patient's uterine cycle or determine type/severity of the endometriosis), tailor treatment based on the phase-specific endometriosis signature, and/or tailor treatment to coincide with a particular phase.

Methods of the invention relate to characterizing and informing treatment of endometriosis. Endometriosis is the abnormal proliferation of endometrial tissue outside of the uterine. The endometrial tissue outside of the uterine is often referred to as ectopic tissue; whereas the normal endometrial tissue that lines the inside of the uterine is referred to as eutopic tissue. In some instances, the ectopic endometrial tissue behaves in a similar manner as the eutopic tissue, i.e. thickening and bleeding over the course of the uterine (or menstrual) cycle. The menstrual fluid generated from the ectopic tissue, unlike the eutopic tissue, has no direct route of discharge. As a result, cysts often form at sites of endometriotic adhesion and the surrounding area may become chronically inflamed, which elicits cellular responses relating to immunity and tissue remodeling.

There are several different types/stages of endometriosis. The stage of the endometriosis is based on the location, amount, depth, and size of the ectopic tissue. Specific criteria include the extent and spread of the tissue, the involvement of pelvic structures in the disease, the extent of pelvic adhesions, and the blockage of fallopian tubes. Stage I (subtle stage) involves minimal ectopic tissue, i.e. subtle cyst-like growths from 1 to 3 mm. Stage II (typical stage) includes mild ectopic tissue, including cysts and fibrous growths that may span 1 to 2 cm. Stage III (cystic ovarian stage) involves large cysts ranging from 4-15 cm that cover ovaries. Stage IV (severe stage) involves wide-spread solid tumors covering a majority of the pelvic structures.

The uterine cycle governing endometrial tissue (both eutopic and ectopic) has several different phases. The different phases are characterized by hormone changes, and thus the phases vary from person to person. The uterine cycle begins with the menstrual or menstruation phase. The menstrual phase is the phase during which the endometrium is shed as menstrual flow. For eutopic tissue, the menstrual flow sheds out of the cervix and vagina, whereas the menstrual flow may not be discharged for ectopic tissue. The first day of menstrual flow is defined as the first day of the menstrual cycle. The menstrual phase lasts about 3 to 7 days. During the menstrual phase, the pituitary glands begin to secrete follicle-stimulating hormone (FSH). The rise in FSH triggers the proliferation phase (Follicular).

The proliferation phase is the part of the uterine cycle during which follicles inside the ovaries develop and mature in preparation for ovulation. The levels of FSH increase in the bloodstream during the proliferation phase, stimulating the maturation of follicles. The follicles each contain an egg, and usually only one will reach full growth and will be released at ovulation. Also during the proliferation phase, the ovaries produce estrogen, which causes endometrium tissue to thicken. Once estrogen levels peak, the pituitary glands slow the secretion of FSH in favor of secreting luteinizing hormone (LH). Increased levels of LH cause the mature follicle to rupture and release the egg. The released egg will travel to the fallopian tubes. The releasing of the egg is called ovulation, and it usually occurs about 14 days from the beginning of the next uterine cycle.

The end of ovulation marks the beginning of the secretory (Luteal) phase. During the secretory phase, LH and FSH decrease. The ruptured follicle closes after releasing the egg and forms a corpus luteum, which produces progesterone. Estrogen levels are high during the secretory phase, and progesterone and estrogen cause the lining of the uterus to thicken more in order to prepare for possible fertilization. If the egg is not fertilized, the corpus luteum degenerates, progesterone production stops and estrogen levels decrease. Eventually, the top layers of the endometrial lining break down and shed, starting a new uterine cycle. The progression of the secretary phase may further broken down in to early secretory, mid secretory, and late secretory. FIG. 1 illustrates the changing of the thickness of the endometrial lining as the uterine cycle progresses from the proliferative stage to the late secretory stage.

Aspects of the invention determine and analyze gene expression patterns during different time-points over the uterine cycle. In certain embodiments, the different time-points are the various phases of the uterine cycle. For example, expression levels of one or more genes may be determined during the menstrual phase, proliferation phase, or the secretory phase (early, mid or late).

Methods of the invention involve obtaining a sample, e.g. a tissue or body fluid, which is suspected to include an endometrial-associated gene or gene product. The sample may be collected in any clinically acceptable manner. A tissue is a mass of connected cells and/or extracellular matrix material, e.g. skin tissue, endometrial tissue, nasal passage tissue, CNS tissue, neural tissue, eye tissue, liver tissue, kidney tissue, placental tissue, mammary gland tissue, placental tissue, gastrointestinal tissue, musculoskeletal tissue, genitourinary tissue, bone marrow, and the like, derived from, for example, a human or other mammal and includes the connecting material and the liquid material in association with the cells and/or tissues. A body fluid is a liquid material derived from, for example, a human or other mammal. Such body fluids include, but are not limited to, mucous, blood, plasma, serum, serum derivatives, bile, blood, maternal blood, phlegm, saliva, sweat, amniotic fluid, menstrual fluid, mammary fluid, follicular fluid of the ovary, fallopian tube fluid, peritoneal fluid, urine, and cerebrospinal fluid (CSF), such as lumbar or ventricular CSF. A sample may also be a fine needle aspirate or biopsied tissue. A sample also may be media containing cells or biological material. In certain embodiments, infertility-associated genes or gene products may be found in reproductive cells or tissues, such as gametic cells, gonadal tissue, fertilized embryos, and placenta. In certain embodiments, the sample is drawn maternal blood or saliva.

In particular embodiments, the sample is obtained from endometrial tissue. The endometrial tissue may be eutopic (e.g. normal intrauterine endometrial tissue), or ectopic, (e.g., misplaced endometrial tissue). The endometrial tissue samples may be obtained over different time-points across the uterine cycle.

Laboratory procedures described below (e.g., determining expression levels using a microarray or nucleic acid extraction, enrichment, amplification, or sequencing) are performed on the sample to determine expression levels for one or more transcripts. Nucleic acid is extracted from the sample according to methods known in the art. See for example, Maniatis, et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y., pp. 280-281, 1982, the contents of which are incorporated by reference herein in their entirety. In certain embodiments, a genomic sample is collected from a subject followed by enrichment for genetic regions or genetic fragments of interest, for example by hybridization to a nucleotide array comprising endometrial-related genes or gene fragments of interest. The sample may be enriched for genes of interest (e.g., endometrial-associated genes) using methods known in the art, such as hybrid capture. See for examples, Lapidus (U.S. Pat. No. 7,666,593), the content of which is incorporated by reference herein in its entirety.

RNA may be isolated from eukaryotic cells by procedures that involve lysis of the cells and denaturation of the proteins contained therein. Tissue of interest includes gametic cells, gonadal tissue, endometrial tissue, fertilized embryos, and placenta. RNA may be isolated from fluids of interest by procedures that involve denaturation of the proteins contained therein. Fluids of interest include blood, menstrual fluid, mammary fluid, follicular fluid of the ovary, peritoneal fluid, or culture medium. Additional steps may be employed to remove DNA. Cell lysis may be accomplished with a nonionic detergent, followed by microcentrifugation to remove the nuclei and hence the bulk of the cellular DNA. In one embodiment, RNA is extracted from cells of the various types of interest using guanidinium thiocyanate lysis followed by CsCl centrifugation to separate the RNA from DNA (Chirgwin et al., Biochemistry 18:5294-5299 (1979)). Poly(A)+RNA is selected by selection with oligo-dT cellulose (see Sambrook et al., MOLECULAR CLONING—A LABORATORY MANUAL (2ND ED.), Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1989). Alternatively, separation of RNA from DNA can be accomplished by organic extraction, for example, with hot phenol or phenol/chloroform/isoamyl alcohol. If desired, RNase inhibitors may be added to the lysis buffer. Likewise, for certain cell types, it may be desirable to add a protein denaturation/digestion step to the protocol.

For many applications, it is desirable to preferentially enrich mRNA with respect to other cellular RNAs, such as transfer RNA (tRNA) and ribosomal RNA (rRNA). Most mRNAs contain a poly(A) tail at their 3′ end. This allows them to be enriched by affinity chromatography, for example, using oligo(dT) or poly(U) coupled to a solid support, such as cellulose or SEPHADEX (see Ausubel et al., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, vol. 2, Current Protocols Publishing, New York (1994). Once bound, poly(A)+mRNA is eluted from the affinity column using 2 mM EDTA/0.1% SDS.

According to certain embodiments, expression levels of a patient are compared to a reference data specific to phase in the uterine cycle. The reference data may comprise phase-specific endometriosis signatures (ectopic signatures) or phase-specific normal signatures (eutopics). The signatures may be determined by conducting a meta-analysis on one or more sources of expression data obtained from normal patients, endometriosis patients, or both. A meta-analysis suitable for use in accordance with the invention is described hereinafter. The phase-specific signatures are typically regulation pattern exhibited by either the healthy or the diseased tissue. Regulation patterns associated with endometriosis typically include up-regulated or de-regulated genes and the misregulation changes across the various phases of the uterine cycle. Up-regulation is a process that occurs within a cell triggered by a signal (originating internal or external to the cell), which results in increased expression of one or more genes and as a result the protein(s) encoded by those genes. Conversely, de-regulation is a process resulting in decreased gene and corresponding protein expression. In certain embodiments, the reference data may include a consensus expression levels associated with a particular patient population (e.g. endometriosis population or normal population).

The following is a list of genes whose expression levels correlate significantly with endometriosis: CCL3L1, CCL3, FAM180A, THBS2, PDGFRL, FN1, CLE11A, CCNA2, KIF20A, BUB1B, HSD17B6, HSD11B1, C7, C3, CXCL2, CXCL12, CXCL13, PDGFC, CXCL14, ACTA2, TAGLN, and SORBS1. As shown in FIG. 8, de-regulated genes associated with the proliferative phase include CCNA2, KIF20A, BUB1B. Up-regulated genes associated with the proliferative phase include HSD17B6, HSD11B1, C7, C3, CXCL2, CXCL12, CXCL14. As indicated in FIG. 9, de-regulated genes associated with the early secretory phase include CXCL13. Up-regulated genes associated with the early secretory phase include CCNA2, KIF20A, BUB1B. As shown in FIG. 10, up-regulated genes associated with the mid- to late include ACTA2, TAGLN, and SORBS1.

Phase-specific genes associated with endometriosis are also described in: Hawkins, Shannon M., et al. “Functional microRNA involved in endometriosis.” Molecular endocrinology 25.5 (2011): 821-832; Sha, G., et al. “Differentially expressed genes in human endometrial endothelial cells derived from eutopic endometrium of patients with endometriosis compared with those from patients without endometriosis.” Human reproduction 22.12 (2007): 3159-3169; Burney, Richard O., et al. “Gene expression analysis of endometrium reveals progesterone resistance and candidate susceptibility genes in women with endometriosis.” Endocrinology 148.8 (2007): 3814-3826; Crispi, Stefania, et al. “Transcriptional profiling of endometriosis tissues identifies genes related to organogenesis defects.” Journal of cellular physiology 228.9 (2013): 1927-1934; Eyster, Kathleen M., et al. “Whole genome deoxyribonucleic acid microarray analysis of gene expression in ectopic versus eutopic endometrium.” Fertility and sterility 88.6 (2007): 1505-1533; Hever, Aniko, et al. “Human endometriosis is associated with plasma cells and overexpression of B lymphocyte stimulator.” Proceedings of the National Academy of Sciences 104.30 (2007): 12451-12456; Hull, M. Louise, et al. “Endometrial-peritoneal interactions during endometriotic lesion establishment.” The American journal of pathology 173.3 (2008): 700-715; Talbi, S., et al. “Molecular phenotyping of human endometrium distinguishes menstrual cycle phases and underlying biological processes in normo-ovulatory women.” Endocrinology 147.3 (2006): 1097-1121.

According to certain aspects, methods of the invention provide for obtaining phase-specific genetic reference data (i.e. signature or regulation pattern) based on data obtained from a number of endometriosis related sources. The data sources may include public and private endometriosis related databases. The reference endometriosis data set may include data obtained from a multitude of patients of similar or diverse background, a variety of sample types, and samples taken over different time points. In certain embodiments, parameters associated with the data set include age, negative/positive diagnosis of endometriosis, stage/type of the disease, pain associated with endometriosis, gravidity/parity, endometrioma position, tissue sampling method, phase of the uterine cycle, and ethnicity.

FIG. 2 provides a schematic illustration of the meta-analysis process. As shown in FIG. 2, microarray data is obtained from several studies that examine gene expression differences between the tissue of patients with endometriosis and the tissue of normal patients (e.g. normal patients). The association between gene expression and endometriosis may be analyzed within each case or by comparing cases and controls using analysis of variance. According to certain embodiments, the micro-array data between studies may differ due to patient variability, tissue type, uterine phase during sample, experimental technique, etc. The micro-array data is entered into the system, processed to normalize the expression data and then subjected to a statistical analysis to identify endometriosis-related expression patterns of statistical significance. Statistical parameters may be chosen to identify gene expression patterns that are statistically significant. Based on the data, the system can also identify statistically significant expression patterns associated with specific phases of the uterine cycle.

Method of logistic regression are described, for example in, Ruczinski (Journal of Computational and Graphical Statistics 12:475-512, 2003); Agresti (An Introduction to Categorical Data Analysis, John Wiley & Sons, Inc., 1996, New York, Chapter 8); and Yeatman et al. (U.S. patent application number 2006/0195269), the content of each of which is hereby incorporated by reference in its entirety.

Other algorithms for analyzing associations are known. For example, the stochastic gradient boosting is used to generate multiple additive regression tree (MART) models to predict a range of outcome probabilities. Each tree is a recursive graph of decisions the possible consequences of which partition patient parameters; each node represents a question (e.g., is the FSH level greater than x?) and the branch taken from that node represents the decision made (e.g. yes or no). The choice of question corresponding to each node is automated. A MART model is the weighted sum of iteratively produced regression trees. At each iteration, a regression tree is fitted according to a criterion in which the samples more involved in the prediction error are given priority. This tree is added to the existing trees, the prediction error is recalculated, and the cycle continues, leading to a progressive refinement of the prediction. The strengths of this method include analysis of many variables without knowledge of their complex interactions beforehand.

A different approach called the generalized linear model, expresses the outcome as a weighted sum of functions of the predictor variables. The weights are calculated based on least squares or Bayesian methods to minimize the prediction error on the training set. A predictor's weight reveals the effect of changing that predictor, while holding the others constant, on the outcome. In cases where one or more predictors are highly correlated, in a phenomenon known as collinearity, the relative values of their weights are less meaningful; steps must be taken to remove that collinearity, such as by excluding the nearly redundant variables from the model. Thus, when properly interpreted, the weights express the relative importance of the predictors. Less general formulations of the generalized linear model include linear regression, multiple regression, and multifactor logistic regression models, and are highly used in the medical community as clinical predictors.

In order to determine expression levels associated with endometriosis that are statistically significant, a series of logistic regression models may be used. The p-values and odds ratio can be used for statistical inference. Logistic regression models are common statistical classification models. The endometriotic expression patterns across the different phases that are statistically significant are considered biomarkers or signatures for the disease.

According to aspects of the invention, the reference phase-specific endometriotic signatures can then be used to identify a patient's phase-specific endometriotic signatures, classify the patient's endometriosis and tailor treatment of the same.

In certain embodiments, the patient's genetic signatures are identified by comparing the patient's expression data across one or more time-points in the uterine cycle to reference phase-specific expression levels. The patient's phase specific genetic signature for endometriosis may include expression levels that are the same as or dissimilar from the reference phase-specific reference data. For example, the reference phase-specific pattern or expression data may represent expression levels of subjects having endometriosis. In such instance, similarities between the patient's expression levels and the reference may be indicative of the patient's phase-specific genetic signature. In another example, the reference phase-specific pattern or expression data may represent expression levels of subjects without endometriosis. In such instance, dissimilarities between the patient's expression levels and the reference may be indicative of the patient's phase-specific genetic signature.

By identifying the patient's phase-specific endometriosis signature, a treatment regimen can be prescribed or set forth in an informative report that is targeted to the patient's signature. For example, a drug or therapeutic that targets the gene or the biochemical pathways associated with the gene may be prescribed. In certain embodiments, the course of treatment is tailored to the patient's expression signatures in each phase. For example, treatment may only be indicated in one of the phases (such as the proliferative phase) or different treatments may be indicated for two or more of the phases. As such, methods of the inventions advantageous inform both timing and type of treatment.

In certain embodiments, the invention provides methods for assessing endometriosis that include determining levels of transcripts present a patient's sample, who is suspected of having endometriosis, identifying those transcript levels that correspond to a regulation pattern specific to a time-point in a uterine cycle and characterizing endometriosis based upon the identified transcript levels. In some embodiments, the time-point of the regulation pattern is a phase of the uterine cycle. The characterization may include determining the phase(s) of the subject's uterine cycle based on the identified transcripts. Additionally, the characterization may include determining the type/stage of the endometriosis based on the identified transcripts. In further embodiments, the method may further include determining the timing or type of treatment for the endometriosis based on the characterization.

Other embodiments involve methods for targeting treatment of endometriosis. For example, some embodiments for targeting the treatment of endometriosis include determining expression levels of one or more genes over different time-points during a subject's uterine cycle, identifying a time point during the uterine cycle in which expression levels are dyssynchronous or dissimilar with respect to a non-endometriotic condition, and informing a course of treatment specific to the subject that coincides with the identified time point. For example, the subject may have differentially expressed genes at a certain phase, in circumstances where a subject's genes are regulation pattern (i.e. upregulated/deregulated) during the proliferative phase is different from the non-endometriodic regulation patterns at the proliferative phase. Treatments may involve a variety of known methods such as hormone therapies (e.g., hormonal contraceptives, gonadotropin-releasing hormone (Gn-RH) agonists and antagonists, Medroxyprogesterone, and Danazol), surgery to remove endometrial tissue, or even hysterectomy.

Further embodiments involve determining phase-specific genetic signatures of a patient across the various phases of the patient's uterine cycle to classify endometriosis. Such methods include determining expression levels of one or more transcripts in a sample obtained from a subject with endometriosis across different time-points of the subject's uterine cycle. The determined transcript levels are then compared to reference transcript levels corresponding to different time-points of the uterine cycle. The reference transcript level may be the consensus expression level of one or more transcripts obtained from a patient population. The patient population chosen for the reference level may be chosen based on certain phenotypic traits—e.g., positive for endometriosis, negative for endometriosis, infertile, fertile, certain age or weight, etc. Based on the comparison, differential transcripts at each time point of the uterine cycle are determined. The differential transcripts at each time point are considered the subject's genetic signature for the respective time points. The subject's genetic signature can then be used to classify endometriosis, e.g., determine the type/stage of the endometriosis and, and to determine a course of treatment specific to the subject's genetic signatures.

In certain aspects, the invention involves assessing transcripts present in a biological sample. Such methods may involve preparing amplified cDNA from total RNA. cDNA is prepared and indiscriminately amplified without diluting the isolated RNA sample or distributing the mixture of genetic material in the isolated RNA into discrete reaction samples. Preferably, amplification is initiated at the 3′ end as well as randomly throughout the whole transcriptome in the sample to allow for amplification of both mRNA and non-polyadenylated transcripts. The double-stranded cDNA amplification products are thus optimized for the generation of sequencing libraries for Next Generation Sequencing platforms. Suitable kits for amplifying cDNA in accordance with the methods of the invention include, for example, the Ovation® RNA-Seq System.

Methods of the invention also involve sequencing the amplified cDNA. While any known sequencing method can be used to sequence the amplified cDNA mixture, single molecule sequencing methods are preferred. Preferably, the amplified cDNA is sequenced by whole transcriptome shotgun sequencing (also referred to herein as (“RNA-Seq”). Whole transcriptome shotgun sequencing (RNA-Seq) can be accomplished using a variety of next-generation sequencing platforms such as the Illumina Genome Analyzer platform, ABI Solid Sequencing platform, or Life Science's 454 Sequencing platform.

Differential transcript levels within the biological sample can also be analyzed using via microarray techniques. The amplified cDNA can be used to probe a microarray containing gene transcripts associated with one or conditions or diseases, such as any prenatal condition, or any type of cancer, inflammatory, or autoimmune disease.

In certain aspects, the invention provides a microarray including a plurality of oligonucleotides attached to a substrate at discrete addressable positions, in which at least one of the oligonucleotides hybridizes to a portion of a gene selected from CCL3L1, CCL3, FAM180A, THBS2, PDGFRL, FN1, CLE11A, CCNA2, KIF20A, BUB1B, HSD17B6, HSD11B1, C7, C3, CXCL2, CXCL12, CXCL13, PDGFC, CXCL14, ACTA2, TAGLN, and SORBS1.

Methods of constructing microarrays are known in the art. See for example Yeatman et al. (U.S. patent application number 2006/0195269), the content of which is hereby incorporated by reference in its entirety.

Microarrays are prepared by selecting probes that include a polynucleotide sequence, and then immobilizing such probes to a solid support or surface. For example, the probes may comprise DNA sequences, RNA sequences, or copolymer sequences of DNA and RNA. The polynucleotide sequences of the probes may also comprise DNA and/or RNA analogues, or combinations thereof. For example, the polynucleotide sequences of the probes may be full or partial fragments of genomic DNA. The polynucleotide sequences of the probes may also be synthesized nucleotide sequences, such as synthetic oligonucleotide sequences. The probe sequences can be synthesized either enzymatically in vivo, enzymatically in vitro (e.g., by PCR), or non-enzymatically in vitro.

The probe or probes used in the methods of the invention are preferably immobilized to a solid support which may be either porous or non-porous. For example, the probes of the invention may be polynucleotide sequences which are attached to a nitrocellulose or nylon membrane or filter covalently at either the 3′ or the 5′ end of the polynucleotide. Such hybridization probes are well known in the art (see, e.g., Sambrook et al., MOLECULAR CLONING—A LABORATORY MANUAL (2ND ED.), Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1989). Alternatively, the solid support or surface may be a glass or plastic surface. In a particularly preferred embodiment, hybridization levels are measured to microarrays of probes consisting of a solid phase on the surface of which are immobilized a population of polynucleotides, such as a population of DNA or DNA mimics, or, alternatively, a population of RNA or RNA mimics. The solid phase may be a nonporous or, optionally, a porous material such as a gel.

In preferred embodiments, a microarray comprises a support or surface with an ordered array of binding (e.g., hybridization) sites or “probes” each representing one of the genes described herein, particularly one of CCL3L1, CCL3, FAM180A, THBS2, PDGFRL, FN1, CLE11A, CCNA2, KIF20A, BUB1B, HSD17B6, HSD11B1, C7, C3, CXCL2, CXCL12, CXCL13, PDGFC, CXCL14, ACTA2, TAGLN, and SORBS1. Preferably the microarrays are addressable arrays, and more preferably positionally addressable arrays. More specifically, each probe of the array is preferably located at a known, predetermined position on the solid support such that the identity (i.e., the sequence) of each probe can be determined from its position in the array (i.e., on the support or surface). In preferred embodiments, each probe is covalently attached to the solid support at a single site.

Microarrays can be made in a number of ways, of which several are described below. However produced, microarrays share certain characteristics. The arrays are reproducible, allowing multiple copies of a given array to be produced and easily compared with each other. Preferably, microarrays are made from materials that are stable under binding (e.g., nucleic acid hybridization) conditions. The microarrays are preferably small, e.g., between 1 cm² and 25 cm², between 12 cm² and 13 cm², or 3 cm². However, larger arrays are also contemplated and may be preferable, e.g., for use in screening arrays. Preferably, a given binding site or unique set of binding sites in the microarray will specifically bind (e.g., hybridize) to the product of a single gene in a cell (e.g., to a specific mRNA, or to a specific cDNA derived therefrom). However, in general, other related or similar sequences will cross hybridize to a given binding site.

The microarrays of the present invention include one or more test probes, each of which has a polynucleotide sequence that is complementary to a subsequence of RNA or DNA to be detected. Preferably, the position of each probe on the solid surface is known. Indeed, the microarrays are preferably positionally addressable arrays. Specifically, each probe of the array is preferably located at a known, predetermined position on the solid support such that the identity (i.e., the sequence) of each probe can be determined from its position on the array (i.e., on the support or surface).

According to the invention, the microarray is an array (i.e., a matrix) in which each position represents one of the biomarkers described herein. For example, each position can contain a DNA or DNA analogue based on genomic DNA to which a particular RNA or cDNA transcribed from that genetic marker can specifically hybridize. The DNA or DNA analogue can be, e.g., a synthetic oligomer or a gene fragment. In one embodiment, probes representing each of the markers are present on the array. In certain embodiments, the array comprises probes for genes known to be associated with endometriosis. In addition, the array probes may be specific to genes known to be associated with endometriosis at a certain phase of the uterine cycle.

As noted above, the probe to which a particular polynucleotide molecule specifically hybridizes according to the invention contains a complementary genomic polynucleotide sequence. The probes of the microarray preferably consist of nucleotide sequences of no more than 1,000 nucleotides. In some embodiments, the probes of the array consist of nucleotide sequences of 10 to 1,000 nucleotides. In a preferred embodiment, the nucleotide sequences of the probes are in the range of 10-200 nucleotides in length and are genomic sequences of a species of organism, such that a plurality of different probes is present, with sequences complementary and thus capable of hybridizing to the genome of such a species of organism, sequentially tiled across all or a portion of such genome. In other specific embodiments, the probes are in the range of 10-30 nucleotides in length, in the range of 10-40 nucleotides in length, in the range of 20-50 nucleotides in length, in the range of 40-80 nucleotides in length, in the range of 50-150 nucleotides in length, in the range of 80-120 nucleotides in length, and most preferably are 60 nucleotides in length.

The probes may comprise DNA or DNA “mimics” (e.g., derivatives and analogues) corresponding to a portion of an organism's genome. In another embodiment, the probes of the microarray are complementary RNA or RNA mimics. DNA mimics are polymers composed of subunits capable of specific, Watson-Crick-like hybridization with DNA, or of specific hybridization with RNA. The nucleic acids can be modified at the base moiety, at the sugar moiety, or at the phosphate backbone. Exemplary DNA mimics include, e.g., phosphorothioates.

DNA can be obtained, e.g., by polymerase chain reaction (PCR) amplification of genomic DNA or cloned sequences. PCR primers are preferably chosen based on a known sequence of the genome that will result in amplification of specific fragments of genomic DNA. Computer programs that are well known in the art are useful in the design of primers with the required specificity and optimal amplification properties, such as Oligo version 5.0 (National Biosciences). Typically each probe on the microarray will be between 10 bases and 50,000 bases, usually between 300 bases and 1,000 bases in length. PCR methods are well known in the art, and are described, for example, in Innis et al., eds., PCR PROTOCOLS: A GUIDE TO METHODS AND APPLICATIONS, Academic Press Inc., San Diego, Calif. (1990). It will be apparent to one skilled in the art that controlled robotic systems are useful for isolating and amplifying nucleic acids.

An alternative, preferred means for generating the polynucleotide probes of the microarray is by synthesis of synthetic polynucleotides or oligonucleotides, e.g., using N-phosphonate or phosphoramidite chemistries (Froehler et al., Nucleic Acid Res. 14:5399-5407 (1986); McBride et al., Tetrahedron Lett. 24:246-248 (1983)). Synthetic sequences are typically between about 10 and about 500 bases in length, more typically between about 20 and about 100 bases, and most preferably between about 40 and about 70 bases in length. In some embodiments, synthetic nucleic acids include non-natural bases, such as, but by no means limited to, inosine. As noted above, nucleic acid analogues may be used as binding sites for hybridization. An example of a suitable nucleic acid analogue is peptide nucleic acid (see, e.g., Egholm et al., Nature 363:566-568 (1993); U.S. Pat. No. 5,539,083).

Probes are preferably selected using an algorithm that takes into account binding energies, base composition, sequence complexity, cross-hybridization binding energies, and secondary structure. See Friend et al., International Patent Publication WO 01/05935, published Jan. 25, 2001; Hughes et al., Nat. Biotech. 19:342-7 (2001).

A skilled artisan will also appreciate that positive control probes, e.g., probes known to be complementary and hybridizable to sequences in the target polynucleotide molecules, and negative control probes, e.g., probes known to not be complementary and hybridizable to sequences in the target polynucleotide molecules, should be included on the array. In one embodiment, positive controls are synthesized along the perimeter of the array. In another embodiment, positive controls are synthesized in diagonal stripes across the array. In still another embodiment, the reverse complement for each probe is synthesized next to the position of the probe to serve as a negative control. In yet another embodiment, sequences from other species of organism are used as negative controls or as “spike-in” controls.

The probes are attached to a solid support or surface, which may be made, e.g., from glass, plastic (e.g., polypropylene, nylon), polyacrylamide, nitrocellulose, gel, or other porous or nonporous material. A preferred method for attaching the nucleic acids to a surface is by printing on glass plates, as is described generally by Schena et al, Science 270:467-470 (1995). This method is especially useful for preparing microarrays of cDNA (See also, DeRisi et al, Nature Genetics 14:457-460 (1996); Shalon et al., Genome Res. 6:639-645 (1996); and Schena et al., Proc. Natl. Acad. Sci. U.S.A. 93:10539-11286 (1995)).

A second preferred method for making microarrays is by making high-density oligonucleotide arrays. Techniques are known for producing arrays containing thousands of oligonucleotides complementary to defined sequences, at defined locations on a surface using photolithographic techniques for synthesis in situ (see, Fodor et al., 1991, Science 251:767-773; Pease et al., 1994, Proc. Natl. Acad. Sci. U.S.A. 91:5022-5026; Lockhart et al., 1996, Nature Biotechnology 14:1675; U.S. Pat. Nos. 5,578,832; 5,556,752; and 5,510,270) or other methods for rapid synthesis and deposition of defined oligonucleotides (Blanchard et al., Biosensors & Bioelectronics 11:687-690). When these methods are used, oligonucleotides (e.g., 60-mers) of known sequence are synthesized directly on a surface such as a derivatized glass slide. Usually, the array produced is redundant, with several oligonucleotide molecules per RNA.

Other methods for making microarrays, e.g., by masking (Maskos and Southern, 1992, Nuc. Acids. Res. 20:1679-1684), may also be used. In principle, and as noted supra, any type of array, for example, dot blots on a nylon hybridization membrane (see Sambrook et al., MOLECULAR CLONING—A LABORATORY MANUAL (2ND ED.), Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1989)) could be used. However, as will be recognized by those skilled in the art, very small arrays will frequently be preferred because hybridization volumes will be smaller.

In one embodiment, the arrays of the present invention are prepared by synthesizing polynucleotide probes on a support. In such an embodiment, polynucleotide probes are attached to the support covalently at either the 3′ or the 5′ end of the polynucleotide.

In a particularly preferred embodiment, microarrays of the invention are manufactured by means of an ink jet printing device for oligonucleotide synthesis, e.g., using the methods and systems described by Blanchard in U.S. Pat. No. 6,028,189; Blanchard et al., 1996, Biosensors and Bioelectronics 11:687-690; Blanchard, 1998, in Synthetic DNA Arrays in Genetic Engineering, Vol. 20, J. K. Setlow, Ed., Plenum Press, New York at pages 111-123. Specifically, the oligonucleotide probes in such microarrays are preferably synthesized in arrays, e.g., on a glass slide, by serially depositing individual nucleotide bases in “microdroplets” of a high surface tension solvent such as propylene carbonate. The microdroplets have small volumes (e.g., 100 pL or less, more preferably 50 pL or less) and are separated from each other on the microarray (e.g., by hydrophobic domains) to form circular surface tension wells, which define the locations of the array elements (i.e., the different probes). Microarrays manufactured by this ink-jet method are typically of high density, preferably having a density of at least about 2,500 different probes per 1 cm.sup.2. The polynucleotide probes are attached to the support covalently at either the 3′ or the 5′ end of the polynucleotide.

The polynucleotide molecules which may be analyzed by the present invention are DNA, RNA, or protein. The target polynucleotides are detectably labeled at one or more nucleotides. Any method known in the art may be used to detectably label the target polynucleotides. Preferably, this labeling incorporates the label uniformly along the length of the DNA or RNA, and more preferably, the labeling is carried out at a high degree of efficiency.

In a preferred embodiment, the detectable label is a luminescent label. For example, fluorescent labels, bioluminescent labels, chemiluminescent labels, and colorimetric labels may be used in the present invention. In a highly preferred embodiment, the label is a fluorescent label, such as a fluorescein, a phosphor, a rhodamine, or a polymethine dye derivative. Examples of commercially available fluorescent labels include, for example, fluorescent phosphoramidites such as FluorePrime (Amersham Pharmacia, Piscataway, N.J.), Fluoredite (Millipore, Bedford, Mass.), FAM (ABI, Foster City, Calif.), and Cy3 or Cy5 (Amersham Pharmacia, Piscataway, N.J.). In another embodiment, the detectable label is a radiolabeled nucleotide.

In a further preferred embodiment, target polynucleotide molecules from a patient sample are labeled differentially from target polynucleotide molecules of a reference sample. The reference can comprise target polynucleotide molecules from normal tissue samples.

Nucleic acid hybridization and wash conditions are chosen so that the target polynucleotide molecules specifically bind or specifically hybridize to the complementary polynucleotide sequences of the array, preferably to a specific array site, wherein its complementary DNA is located.

Arrays containing double-stranded probe DNA situated thereon are preferably subjected to denaturing conditions to render the DNA single-stranded prior to contacting with the target polynucleotide molecules. Arrays containing single-stranded probe DNA (e.g., synthetic oligodeoxyribonucleic acids) may need to be denatured prior to contacting with the target polynucleotide molecules, e.g., to remove hairpins or dimers which form due to self complementary sequences.

Optimal hybridization conditions will depend on the length (e.g., oligomer versus polynucleotide greater than 200 bases) and type (e.g., RNA, or DNA) of probe and target nucleic acids. One of skill in the art will appreciate that as the oligonucleotides become shorter, it may become necessary to adjust their length to achieve a relatively uniform melting temperature for satisfactory hybridization results. General parameters for specific (i.e., stringent) hybridization conditions for nucleic acids are described in Sambrook et al., MOLECULAR CLONING—A LABORATORY MANUAL (2ND ED.), Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1989), and in Ausubel et al., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, vol. 2, Current Protocols Publishing, New York (1994). Typical hybridization conditions for the cDNA microarrays of Schena et al. are hybridization in 5×SSC plus 0.2% SDS at 65° C. for four hours, followed by washes at 25° C. in low stringency wash buffer (1×SSC plus 0.2% SDS), followed by 10 minutes at 25° C. in higher stringency wash buffer (0.1×SSC plus 0.2% SDS) (Schena et al., Proc. Natl. Acad. Sci. U.S.A. 93:10614 (1993)). Useful hybridization conditions are also provided in, e.g., Tijessen, 1993, HYBRIDIZATION WITH NUCLEIC ACID PROBES, Elsevier Science Publishers B. V.; and Kricka, 1992, NONISOTOPIC DNA PROBE TECHNIQUES, Academic Press, San Diego, Calif.

Particularly preferred hybridization conditions include hybridization at a temperature at or near the mean melting temperature of the probes (e.g., within 51° C., more preferably within 21° C.) in 1 M NaCl, 50 mM MES buffer (pH 6.5), 0.5% sodium sarcosine and 30% formamide.

When fluorescently labeled genes or gene products are used, the fluorescence emissions at each site of a microarray may be, preferably, detected by scanning confocal laser microscopy. In one embodiment, a separate scan, using the appropriate excitation line, is carried out for each of the two fluorophores used. Alternatively, a laser may be used that allows simultaneous specimen illumination at wavelengths specific to the two fluorophores and emissions from the two fluorophores can be analyzed simultaneously (see Shalon et al., 1996, “A DNA microarray system for analyzing complex DNA samples using two-color fluorescent probe hybridization,” Genome Research 6:639-645, which is incorporated by reference in its entirety for all purposes). In a preferred embodiment, the arrays are scanned with a laser fluorescent scanner with a computer controlled X-Y stage and a microscope objective. Sequential excitation of the two fluorophores is achieved with a multi-line, mixed gas laser and the emitted light is split by wavelength and detected with two photomultiplier tubes. Fluorescence laser scanning devices are described in Schena et al., Genome Res. 6:639-645 (1996), and in other references cited herein. Alternatively, the fiber-optic bundle described by Ferguson et al., Nature Biotech. 14:1681-1684 (1996), may be used to monitor mRNA abundance levels at a large number of sites simultaneously.

In the study discussed in example 3 below, among PCOS patients, obesity had significant negative effects on implantation rate by odds ratio, or OR (<50%, OR=0.55, p=0.02), clinical pregnancy (OR=0.57, p=0.03) and live birth (OR=0.44, p=0.02) outcome while no significant adverse effects from obesity were determined for other patient groups (i.e., diminished ovarian reserve, endometriosis, idiopathic, male factor, PCOS, and tubal factor). FIG. 11 illustrates the percentage of cohort for patient groups in the study. FIG. 12 illustrates methods for analyzing multi-cycle IVF data including the method used in the study discussed below where generalized estimation equation (GEE) is used to calculate patient-level distributions using all available patient-specific IVF cycle data.

For PCOS patients, obesity increases the risk of IVF treatment failure over two-fold and, specifically, obesity was found to adversely affect implantation rate, clinical pregnancy and live birth outcomes, obesity was found to have a negative influence on uterine receptivity and embryo implantation for PCOS patients. FIG. 13 illustrates the impact of obesity on oocyte retrieval, number of viable embryos, implantation rate, and live birth outcomes for patient groups including patients suffering from diminished ovarian reserve, endometriosis, idiopathic, male factor, PCOS, and tubal factor showing a significant decrease in implantation rate and number of live birth outcomes for obese PCOS patients.

Methods of the invention include determining a likelihood of IVF treatment success for a patient or individual based on a PCOS diagnosis and a measure of obesity. Body fat may be indicated by weight, waist circumference (e.g., the circumference of the abdomen, measured at the natural waist (in between the lowest rib and the top of the hip bone), the umbilicus (belly button), or at the narrowest point of the midsection), waist-to-hip ratio (e.g., calculated by measuring the waist and the hip (at the widest diameter of the buttocks), and then dividing the waist measurement by the hip measurement), skinfold thickness (e.g., using a special caliper to measure the thickness of a “pinch” of skin and the fat beneath it in specific areas of the body and using equations to predict body fat percentage based on these measurements), bioelectrical impedance (see, Hu F. Measurements of Adiposity and Body Composition. In: Hu F, ed. Obesity Epidemiology. New York City: Oxford University Press, 2008; 53-83, incorporated herein in its entirety), underwater weighing (densitometry), air-displacement plethysmography, dilution method (magnetic resonance imaging, or dual energy X-ray absorptiometry. In a preferred embodiment, body fat is indicated by body mass index (BMI). BMI is the ratio of weight to height, calculated as weight (kg)/height (m2), or weight (lb)/height (in2) multiplied by 703.

Body fat, as measured by one of the methods above, can then be compared to a reference number to determine if the individual is obese. Diminished success rates for PCOS diagnosed individuals may be indicated where, for instance, BMI is determined to be greater than 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, or 35 kg/m². In a preferred embodiment, a BMI greater than 30 kg/m² is considered to obese and at higher risk of IVF failure.

In certain embodiments, systems and methods of the invention include receiving a PCOS diagnosis for a patient. In other embodiments, systems and methods may include diagnosing PCOS in an individual through, for example, one of the methods described in Sheehan, Polycystic Ovarian Syndrome: Diagnosis and Management, Clin Med Res. 2004 February; 2(1): 13-27.

After obtaining or determining a PCOS diagnosis and an indication of obesity in an individual, methods of the invention may include determining a likelihood of IVF success for the individual (e.g., represented by a % score indicating a likelihood of live birth after IVF treatment, a % reduction from average success rate, or an estimated number of cycles to achieve live birth). The likelihood of IVF success rate may be reported to the individual or the individual's physician alone or in combination with other fertility information

In certain embodiments, the PCOS and obesity information may be combined with other factors to determine an overall likelihood of IVF success for the patient or to provide treatment recommendations for the patient.

In the study discussed in example 4 below, a large cohort of retrospective pre-implantation genetic screening (PGS) data was studied to clarify the respective contributions of FSH and age to aneuploidy. While no age-independent association between FSH and aneuploidy odds was found, the age-associated increase in aneuploidy odds was more pronounced in patients with FSH levels above 13 mUI/mL where odds of aneuploidy increased at a substantially higher rate (50%) for each additional year (OR=1.52, p<0.0001) of life.

Methods of the invention include determining a woman's relative risk of producing an aneuploid embryo based upon her age and her FSH level. FSH level may be determined from a body fluid such as urine or blood. A sample may be obtained directly from the patient or may be received. Because urine levels of FSH vary throughout the day, in certain embodiments, urine may be collected over a 24-hour period before FSH levels are determined. FSH levels in the sample may be determined using a laboratory procedure such as an immunofluorometric assay. See Kesner J S, Knecht E A, Krieg E F., Jr Time-resolved immunofluorometric assays for urinary luteinizing hormone and follicle stimulating hormone. Anal Chim Acta. 1994; 285:13-22 incorporated herein by reference in its entirety.

A greater increase, by age, of aneuploidy rates may be indicated where the FSH level for a woman is greater than as threshold level such as, for example, 10, 11, 12, 13, 14, or 15 mUI/ML. In a preferred embodiment, where a woman's FSH level is above 13 mUI/mL, she may be at an increased risk of producing aneuploid embryos as she ages. In various embodiments, the woman's risk of producing an aneuploid embryo may be determined from her FSH levels and her age above puberty or fertility. Where the woman's FSH level is below the threshold level, the risk may be increased by 8%, 9%, 10%, 11%, or 12% (from an initial or base risk level) for each year of her reproductive lifespan (e.g., time from beginning of puberty to menopause, or from beginning of regular ovulation to menopause). Where the woman's FSH level is above the threshold level, the risk may be increased by 13%, 14%, 15%, 16%, or 17% (from an initial or base risk level) for each year of her reproductive lifespan. In certain instances, reproductive lifespan may be determined for an individual based on the actual age they reached puberty or began regular ovulation (as determined, for example, by a detailed patient history) or may be assumed to have begun at standard age such as 12, 13, 14, 15, 16, or 17. An initial or base risk of producing an aneuploid embryo may be determined from an average rate among the population or taken from known studies such as Franasiak, et al., The nature of aneuploidy with increasing age of the female partner: a review of 15,169 consecutive trophectoderm biopsies evaluated with comprehensive chromosomal screening, Fertil Steril. 2014 March; 101(3):656-663; incorporated herein in its entirety.

In certain embodiments, systems and methods of the invention may include reporting to this increased risk to the patient, physician, or other individual, where the patient's FSH level is greater than 13 mUI/mL. Various embodiments may include recommending or performing a treatment for the patient including avoiding certain assistive reproductive technologies, beginning treatments earlier, or, in some cases, harvesting eggs or embryos and storing for later use in assistive reproductive technologies such as IVF. Methods for retrieving and/or storing eggs and embryos are known. See Cil, et al., Current trends and progress in clinical applications of oocyte cryopreservation, Curr Opin Obstet Gynecol. 2013 June; 25(3); Killick, S (2006). “Ultrasound and fertility”. In Bates, J. Practical gynaecological ultrasound (2nd ed.). Cambridge, England: Cambridge University Press. pp. 120-5; the contents of which are incorporated herein in their entirety.

Reports as referred to herein may be produced in written form on paper or in a computer file and may be prepared by a computing device and sent to a user (e.g., patient, physician or other individual) through an input/output device such as a monitor, interactive display, or printer, for example.

Methods of the invention may be performed using any type of computing device, such as a computer, that includes a processor, e.g., a central processing unit, or any combination of computing devices where each device performs at least part of the process or method. In some embodiments, systems and methods described herein may be performed with a handheld device, e.g., a smart tablet, or a smart phone, or a specialty device produced for the system.

Methods of the invention can be performed using software, hardware, firmware, hardwiring, or combinations of any of these. Features implementing functions can also be physically located at various positions, including being distributed such that portions of functions are implemented at different physical locations (e.g., imaging apparatus in one room and host workstation in another, or in separate buildings, for example, with wireless or wired connections).

Processors suitable for the execution of computer program include, by way of example, both general and special purpose microprocessors, and any one or more processor of any kind of digital computer. Generally, a processor will receive instructions and data from a read-only memory or a random access memory or both. The essential elements of computer are a processor for executing instructions and one or more memory devices for storing instructions and data. Generally, a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto-optical disks, or optical disks. Information carriers suitable for embodying computer program instructions and data include all forms of non-volatile memory, including by way of example semiconductor memory devices, (e.g., EPROM, EEPROM, solid state drive (SSD), and flash memory devices); magnetic disks, (e.g., internal hard disks or removable disks); magneto-optical disks; and optical disks (e.g., CD and DVD disks). The processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.

To provide for interaction with a user, the subject matter described herein can be implemented on a computer having an I/O device, e.g., a CRT, LCD, LED, or projection device for displaying information to the user and an input or output device such as a keyboard and a pointing device, (e.g., a mouse or a trackball), by which the user can provide input to the computer. Other kinds of devices can be used to provide for interaction with a user as well. For example, feedback provided to the user can be any form of sensory feedback, (e.g., visual feedback, auditory feedback, or tactile feedback), and input from the user can be received in any form, including acoustic, speech, or tactile input.

The subject matter described herein can be implemented in a computing system that includes a back-end component (e.g., a data server), a middleware component (e.g., an application server), or a front-end component (e.g., a client computer having a graphical user interface or a web browser through which a user can interact with an implementation of the subject matter described herein), or any combination of such back-end, middleware, and front-end components. The components of the system can be interconnected through network by any form or medium of digital data communication, e.g., a communication network. For example, the reference set of data may be stored at a remote location and the computer communicates across a network to access a reference set of data for all patients along with clinical outcomes (e.g., IVF success rates) to compare data derived from the female subject to the reference set. In other embodiments, however, the reference set is stored locally within the computer and the computer accesses the reference set within the CPU to compare subject data to the reference set. Examples of communication networks include cell network (e.g., 3G or 4G), a local area network (LAN), and a wide area network (WAN), e.g., the Internet.

The subject matter described herein can be implemented as one or more computer program products, such as one or more computer programs tangibly embodied in an information carrier (e.g., in a non-transitory computer-readable medium) for execution by, or to control the operation of, data processing apparatus (e.g., a programmable processor, a computer, or multiple computers). A computer program (also known as a program, software, software application, app, macro, or code) can be written in any form of programming language, including compiled or interpreted languages (e.g., C, C++, Perl), and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. Systems and methods of the invention can include instructions written in any suitable programming language known in the art, including, without limitation, C, C++, Perl, Java, ActiveX, HTML5, Visual Basic, or JavaScript.

A computer program does not necessarily correspond to a file. A program can be stored in a file or a portion of file that holds other programs or data, in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub-programs, or portions of code). A computer program can be deployed to be executed on one computer or on multiple computers at one site or distributed across multiple sites and interconnected by a communication network.

A file can be a digital file, for example, stored on a hard drive, SSD, CD, or other tangible, non-transitory medium. A file can be sent from one device to another over a network (e.g., as packets being sent from a server to a client, for example, through a Network Interface Card, modem, wireless card, or similar).

Writing a file according to the invention involves transforming a tangible, non-transitory computer-readable medium, for example, by adding, removing, or rearranging particles (e.g., with a net charge or dipole moment into patterns of magnetization by read/write heads), the patterns then representing new collocations of information about objective physical phenomena desired by, and useful to, the user. In some embodiments, writing involves a physical transformation of material in tangible, non-transitory computer readable media (e.g., with certain optical properties so that optical read/write devices can then read the new and useful collocation of information, e.g., burning a CD-ROM). In some embodiments, writing a file includes transforming a physical flash memory apparatus such as NAND flash memory device and storing information by transforming physical elements in an array of memory cells made from floating-gate transistors. Methods of writing a file are well-known in the art and, for example, can be invoked manually or automatically by a program or by a save command from software or a write command from a programming language.

Suitable computing devices typically include mass memory, at least one graphical user interface, at least one display device, and typically include communication between devices. The mass memory illustrates a type of computer-readable media, namely computer storage media. Computer storage media may include volatile, nonvolatile, removable, and non-removable media implemented in any method or technology for storage of information, such as computer readable instructions, data structures, program modules, or other data. Examples of computer storage media include RAM, ROM, EEPROM, flash memory, or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, Radiofrequency Identification tags or chips, or any other medium which can be used to store the desired information and which can be accessed by a computing device.

As one skilled in the art would recognize as necessary or best-suited for performance of the methods of the invention, a computer system or machines of the invention include one or more processors (e.g., a central processing unit (CPU) a graphics processing unit (GPU) or both), a main memory and a static memory, which communicate with each other via a bus.

In an exemplary embodiment shown in FIG. 14, system 200, capable of carrying out methods of the invention, can include a computer 249 (e.g., laptop, desktop, or tablet). The computer 249 may be configured to communicate across a network 209. Computer 249 includes one or more processor 259 and memory 263 as well as an input/output mechanism 254. Where methods of the invention employ a client/server architecture, an steps of methods of the invention may be performed using server 213, which includes one or more of processor 221 and memory 229, capable of obtaining data, instructions, etc., or providing results via interface module 225 or providing results as a file 217. Server 213 may be engaged over network 209 through computer 249 or terminal 267, or server 213 may be directly connected to terminal 267, including one or more processor 275 and memory 279, as well as input/output mechanism 271.

Systems 200 or machines according to the invention may further include, for any of I/O 259 or 237, or interface module 225, a video display unit (e.g., a liquid crystal display (LCD) or a cathode ray tube (CRT)). Computer systems or machines according to the invention can also include an alphanumeric input device (e.g., a keyboard), a cursor control device (e.g., a mouse), a disk drive unit, a signal generation device (e.g., a speaker), a touchscreen, an accelerometer, a microphone, a cellular radio frequency antenna, and a network interface device, which can be, for example, a network interface card (NIC), Wi-Fi card, or cellular modem.

Memory 263, 279, or 229 according to the invention can include a machine-readable medium on which is stored one or more sets of instructions (e.g., software) embodying any one or more of the methodologies or functions described herein. The software may also reside, completely or at least partially, within the main memory and/or within the processor during execution thereof by the computer system, the main memory and the processor also constituting machine-readable media. The software may further be transmitted or received over a network via the network interface device.

It will be understood that any portion of the systems and methods disclosed herein, can be implemented by computer, including the devices described above. Information is collected from a female subject. This data is then inputted into the central processing unit (CPU) of a computer. The CPU is coupled to a storage or memory for storing instructions for implementing methods of the present invention. The instructions, when executed by the CPU, cause the CPU to provide a probability of successful in vitro fertilization in a selected cycle of in vitro fertilization. The CPU provides this determination by inputting the subject data into an algorithm trained on a reference set of data from a plurality of women for whom fertility-associated phenotypic traits and pregnancy outcomes for each cycle of IVF is known. The reference set of data may be stored locally within the computer, such as within the computer memory. Alternatively, the reference set may be stored in a location that is remote from the computer, such as a server. In this instance, the computer communicates across a network to access the reference set of data. The CPU then provides a probability of achieving pregnancy at a selected point in time based on the data entered into the algorithm.

INCORPORATION BY REFERENCE

References and citations to other documents, such as patents, patent applications, patent publications, journals, books, papers, web contents, have been made throughout this disclosure. All such documents are hereby incorporated herein by reference in their entirety for all purposes.

EQUIVALENTS

The invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The foregoing embodiments are therefore to be considered in all respects illustrative rather than limiting on the invention described herein. Scope of the invention is thus indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore

Example 1

Several studies have compared the gene expression signatures of tissue from normal and endometriosis patients, identifying significant differences in the expression of particular functional pathways, such as focal adhesion, tissue remodeling and immune response.

However, there is often discrepancy in the identity of the genes themselves, most likely being a product of inter-experimental patient variability, tissue type, cohort size, experimental technique, significance thresholds etc. To help more faithfully define of the gene expression signature consistently associated with endometriosis, a meta-analysis of microarray data is drawn from several published papers.

The aim of the meta-analysis is to determine whether comparison of this signature with patient specific gene expression data leads to the identification genes whose differential expression is derived from patient-specific genetic variation, as opposed to those whose expression changes merely as a product of endometriosis.

Methodology:

-   -   Datasets:         -   GSE23339 (Hawkins et al., 2011): ovarian endometrioma vs             eutopic endometrium from a *separate* group of normal             patients.         -   GSE7846, GSE6364 (Sha et al., 2007; Burney et al., 2007):             eutopic endometrium from endometriosis patients vs separate             group of normal patients.     -   A normalized expression matrix was computed for each study: For         Affymetrix data, RMA normalization was used (Rafael et al.,         2003). For Illumina, the log 2 normalized values reported in the         original publications were used.     -   QC metrics were calculated using Bioconductor packages         (Gentleman et al., 2004) such as arrayQualityMetrics and         affyQCReport.     -   The case control analyses were performed within each study using         an empirical Bayes moderated t-test for each transcript. Results         were combined across studies using a fixed effects         meta-analysis, combining transcripts for a given gene across         studies, and weighting by the inverse of the variances estimates         for each transcript. The meta-analysis thus yielded a consensus         mean, associated standard error, t-score, and p-value for each         gene. False discovery rates were estimated using standard         methods.     -   Pathway analysis was performed using SPIA (signaling pathway         impact analysis). Gene ontology analysis was performed using         both i) a Fisher's Exact test on the counts of significant genes         (p<0.005) in and not in a given GO term, and ii) a Wilcoxon Rank         Sum test on the gene specific t scores comparing genes in and         not in a given GO term.

Results:

-   -   See Pathways List (below): It appears that the more ‘specific’         the pathway categories are, the lower the magnitude of the         difference between endometriosis and normal sample gene         expression. This might suggest that endometriosis is a disease         of ‘many genes’, as opposed to a handful of specific drivers.         This would fit with its heterogeneous etiology, as well as the         fact that studies do not seem to reach a consensus on the         endometriosis expression signature.     -   In accordance with existing literature, pathway categories with         the highest magnitude include chemokine/cytokine signaling and         other immune response mechanisms, focal adhesion, extra-cellular         matrix interactions, and angiogenesis (See ‘Pathways’ list.)     -   Similarly, among the list of genes (see below) whose expression         is significantly different in association with endometriosis,         the upregulation of CCL3L1, CCL3, FAM180A, THBS2, PDGFRL, FN1,         CLEC11A all reflect the invasive tissue remodeling and         immune-response associated with the development of         endometriosis.     -   There are another two interesting pathways that are identified         as significant but that have been less discussed in previous         studies. These might be worth more careful consideration:

1. Leukocyte Transendothelial Migration.

-   -   Most likely, this is detected as significant at least partly due         to the presence of immunocytes in endometriotic tissue. However,         while previous studies have argued that endometriosis is         invasive but not strictly metastatic, the expression of a number         of the genes in the LTM pathway is not specific to leukocytes or         leukocyte migration:     -   ACTG1, Claudin-4 and EZR, among others, have all been implicated         in the mediation of cell motility in different cancer types.     -   Furthermore, among genes whose expression is most altered in the         endometriotic state, 2 of the most upregulated genes include         matrix metalloproteinase 23B (MMP23B, 2.029 fold increase,         confidence=11.03) and thrombospondin 2 (THBS2, 2.012 fold         increase, confidence=5.037).     -   MMP23 has been found to be predominantly expressed in         reproductive tissues (Velasco et al., 1999). Degradation of ECM         is essential for cells to invade the peritoneum. MMPs are         involved in the breakdown of extracellular matrix during         embryonic development and reproduction, as well as arthritis and         metastasis (reviewed by Christiane et al., 2014).     -   THBS2 modulates cell-matrix interactions, and interestingly, is         involved with integrin aVB3 in the modulation of cell spreading         and migration of endothelial cells (Bornstein et al., 2000).     -   Similarly, OSR2 is significantly downregulated (0.712 fold,         confidence=12.65). TGF-beta1-mediated downregulation of this         transcription factor is associated with the induction of cell         migration (Kawai et al., 2012). TGFB1 is actually enriched at         sites of ectopic endometrium (Komiyana et al., 2007; Medina and         Lebovic, 2010).

2. Axon Guidance/Semaphorin Interactions.

-   -   Biological neuronal markers indicate specific types of nerve         fibers present within endometrial layers (Tokushige et al.,         2006).     -   There is some literature describing nerve fiber density         differences between women with diagnosed endometriosis and women         without endometriosis but no indication as to a driving         mechanism.     -   Our identification of significant gene expression would be (at         least among) the first to suggest a mechanism by which this         increased innervation might take place.     -   Our analysis reveals that a number of genes involved in axon         guidance are significantly upregulated in association with         endometriosis, eg:     -   ROBO3, which competes with ROBO1/2, representing an         anti-repulsion mechanism;     -   SEMA7A, in the absence of which neuroendocrine cell migration is         impaired in mice (Messina et al., 2011).         Gene List for Example 1: Gene Expression Signatures Associated         with Endometriosis

Gene Link Degree of Dif Confidence AKA Details 326 SPRR2F SPRR2F 6.83 4.012 Involved in the inflammatory stress response (Gleyzer and Scarpulla, 2013) Possibly endometrium specific (Contreras et al., 2010) 366 CCL3L1 CCL3L1 3.864 3.037 Cytokines are secreted proteins that function in inflammatory and immunoregulatory processes, via their interaction with several chemokine receptors, including chemokine binding protein 2 and chemokine (C-C motif) receptor 5 (CCR5). The copy number of this gene varies among individuals, where most individuals have one to six copies, and a minority of individuals have zero or more than six copies. There are conflicting reports about copy number variation of this gene and its correlation to disease susceptibility. 364 CCL3 CCL3 3.697 3.161 This locus represents a small inducible cytokine. The encoded protein, also known as macrophage inflammatory protein 1 alpha, plays a role in inflammatory responses through binding to the receptors CCR1, CCR4 and CCR5. 372 HLA- HLA- 3.461 2.491 Expressed in antigen DRB3 DRB3 presenting cells - constitutes part of the histocompatibility complex. 1 LOC100240735 LOC100240735 3.431 16 322 ADAMTS9- ADAMTS9- 3.429 4.047 AS1 AS1 70 FAM180A FAM180A 2.898 6.725 Expression is TGF-B dependent in mammalian systems (Kosla et al., 2013) 361 NCF1 NCF1 2.487 3.229 Neutrophil cytosolic fator - reflecting the accumulation of white blood cells, commonly associated with ednometriosis. 15 DACT1 DACT1 2.155 11.802 Dishevelled signaling mediator - antoagonist of beta-catenin, therefore potentially contributing to the ‘metastatic’ pathology of endometriosis, through mediating loss of adhesion? 175 ITLN2 ITLN2 2.122 5.107 Possibly involved in immune response? 227 C3 C3 2.059 4.682 Complement component 3 - complement activation. 23 REP15 REP15 2.047 9.911 17 MMP23B MMP23B 2.029 11.031 A metallopeptidase involved in the breakdown of extracellular matrix during embryonic development and reproduction, as well as arthritis and metastasis . . . Degradation of ECM is essential for cells to invade the peritoneum 12 CFH CFH 2.025 12.259 Member of the Regulator of Complement Activation (RCA) gene cluster. Encodes a protein with twenty short consensus repeat (SCR) domains. Secreted into the bloodstream and has an essential role in the regulation of complement activation, restricting this innate defense mechanism to microbial infections. 181 THBS2 THBS2 2.012 5.037 Modulates cell-matrix interactions, and interestingly, is involved with integrin aVB3 in the modulation of endothelial cell properties like cell spreading and migration (Bornstein et al., 2000) 101 IGDCC4 IGDCC4 1.945 6.109 Immunoglobulin superfamily 375 FLJ41200 FLJ41200 1.945 2.259 54 LOC100506700 LOC100506700 1.714 7.257 204 PDGFRL PDGFRL 1.682 4.87 11 FAM101B FAM101B 1.671 12.474 2 FN1 FN1 1.668 16 Fibronectin Cell adhesion, migration, embryogenesis, wound healing, blood coagulation, host defense 355 SCG5 SCG5 1.662 3.492 3 CLEC11A CLEC11A 1.635 16 C- Growth factor for type primitive hematopoietic lectin progenitor cells domain family 11, 346 HIST2H2BF HIST2H2BF 1.635 3.78 343 FLJ27354 FLJ27354 1.594 3.855 6 PLTP PLTP 1.577 13.447 180 CDCA7L CDCA7L 1.576 5.052 48 S100A10 S100A10 1.571 7.426 94 LBH LBH 1.57 6.259 293 MGC24103 MGC24103 1.566 4.316 119 NID2 NID2 1.564 5.792 287 SGK1 SGK1 1.546 4.356 201 OR2A9P OR2A9P 1.538 4.897 363 TXNDC5 TXNDC5 1.53 3.165 281 LOC339524 LOC339524 1.527 4.435 190 SLC7A5P2 SLC7A5P2 1.508 4.996 289 FMOD FMOD 1.503 4.336 318 MATN2 MATN2 1.498 4.074 39 KCTD12 KCTD12 1.492 7.786 351 C1R C1R 1.462 3.647 278 CXCL12 CXCL12 1.454 4.471 323 LOC648570 LOC648570 1.448 4.042 332 STEAP1 STEAP1 1.444 3.966 335 HMOX1 HMOX1 1.44 3.951 250 SNAI2 SNAI2 1.438 4.587 348 NFKBIZ NFKBIZ 1.436 3.724 352 DHRS4 DHRS4 1.432 3.585 327 LY96 LY96 1.417 4.012 336 LOXL1 LOXL1 1.417 3.942 328 IL32 IL32 1.414 4.009 97 GUCY1B3 GUCY1B3 1.413 6.21 38 GPC6 GPC6 1.405 7.837 313 OR2A20P OR2A20P 1.4 4.146 64 PGCP PGCP 1.397 6.848 317 JSRP1 JSRP1 1.387 4.127 338 TUBB6 TUBB6 1.384 3.918 324 ZFPM2 ZFPM2 1.383 4.026 240 GLT8D2 GLT8D2 1.38 4.628 40 HSD17B11 HSD17B11 1.377 7.754 152 PMP22 PMP22 1.375 5.417 118 ECHDC3 ECHDC3 1.368 5.802 222 DFNA5 DFNA5 1.362 4.709 263 ATL1 ATL1 1.362 4.536 349 TOMM7 TOMM7 1.36 3.717 29 GYPC GYPC 1.354 8.632 127 PABPC4L PABPC4L 1.351 5.694 276 KLF10 KLF10 1.346 4.494 295 PRKCDBP PRKCDBP 1.338 4.303 288 BGN BGN 1.327 4.338 80 CNN3 CNN3 1.326 6.538 202 OSGIN1 OSGIN1 1.326 4.885 90 NEK7 NEK7 1.322 6.33 321 JUN JUN 1.32 4.054 155 CNN2 CNN2 1.317 5.332 143 CTSL1 CTSL1 1.316 5.552 285 GPR141 GPR141 1.315 4.39 345 MN1 MN1 1.315 3.815 9 TAGLN TAGLN 1.314 12.763 340 BST1 BST1 1.31 3.912 22 FLJ20021 FLJ20021 1.303 9.949 21 CASP4 CASP4 1.301 10.271 177 GLIPR2 GLIPR2 1.301 5.105 217 FOXO4 FOXO4 1.301 4.767 103 CLEC10A CLEC10A 1.297 6.068 314 MSC MSC 1.292 4.139 43 CSTA CSTA 1.289 7.564 334 SACS SACS 1.289 3.957 16 CTSC CTSC 1.286 11.537 174 C14orf37 C14orf37 1.279 5.114 330 FLJ41309 FLJ41309 1.277 3.999 325 HOXC6 HOXC6 1.276 4.013 107 KLF6 KLF6 1.272 5.988 296 FKBP9 FKBP9 1.271 4.293 55 CDH11 CDH11 1.27 7.166 298 ZNF815 ZNF815 1.269 4.283 312 GLIPR1 GLIPR1 1.267 4.174 62 FAM70B FAM70B 1.263 6.876 272 LGALS2 LGALS2 1.261 4.507 27 ICAM5 ICAM5 1.259 8.771 246 TMEM51 TMEM51 1.259 4.61 19 RGS10 RGS10 1.253 10.426 69 CHST15 CHST15 1.248 6.73 116 GRASP GRASP 1.245 5.818 44 WWC3 WWC3 1.244 7.553 316 COL6A3 COL6A3 1.242 4.135 150 EMP1 EMP1 1.241 5.423 42 CAV2 CAV2 1.238 7.57 194 SORCS2 SORCS2 1.237 4.984 L 171 GAS6 GAS6 1.235 5.156 302 LAPTM5 LAPTM5 1.235 4.24 195 AOAH AOAH 1.229 4.981 307 SCT SCT 1.221 4.194 139 RAB23 RAB23 1.22 5.584 73 GUCY2D GUCY2D 1.218 6.631 162 PDE1B PDE1B 1.215 5.261 207 ROM1 ROM1 1.214 4.861 102 CGREF1 CGREF1 1.211 6.095 243 ROBO3 ROBO3 1.211 4.622 35 CCM2 CCM2 1.21 7.952 79 LOC100505500 LOC100505500 1.21 6.546 59 HSD17B14 HSD17B14 1.206 6.926 290 LRRC3 LRRC3 1.206 4.331 291 C3orf54 C3orf54 1.206 4.326 284 PLS3 PLS3 1.205 4.432 129 CNIH CNIH 1.2 5.668 138 ODZ4 ODZ4 1.199 5.592 283 EEF1A2 EEF1A2 1.198 4.433 114 COL6A2 COL6A2 1.196 5.832 126 SLC7A1 SLC7A1 1.196 5.697 274 PIK3CD PIK3CD 1.196 4.495 188 ADAM8 ADAM8 1.191 4.999 212 ALDH1B1 ALDH1B1 1.191 4.808 63 NAT8B NAT8B 1.188 6.871 179 ADAMTS2 ADAMTS2 1.188 5.064 261 RPL23AP53 RPL23AP53 1.187 4.544 258 ZNF703 ZNF703 1.186 4.559 259 ST3GAL1 ST3GAL1 1.185 4.547 34 GTSE1 GTSE1 1.184 8.471 273 LOC100507054 LOC100507054 1.184 4.502 219 C19orf40 C19orf40 1.182 4.762 4 GPR161 GPR161 1.181 16 31 RECQL RECQL 1.18 8.574 36 FOXC1 FOXC1 1.179 7.909 262 CBS CBS 1.177 4.537 294 ZMYND15 ZMYND15 1.177 4.314 218 ACAP1 ACAP1 1.176 4.765 249 EGFLAM EGFLAM 1.174 4.589 286 RAD51AP2 RAD51AP2 1.174 4.377 238 SHISA4 SHISA4 1.173 4.637 275 SH3BGRL SH3BGRL 1.173 4.495 8 ARPC2 ARPC2 1.172 13.069 244 C21orf30 C21orf30 1.172 4.62 18 POU2F2 POU2F2 1.171 10.542 153 TIMP2 TIMP2 1.171 5.363 208 ZNF503 ZNF503 1.171 4.847 85 TLE3 TLE3 1.168 6.434 185 BEGAIN BEGAIN 1.168 5.023 81 IKBIP IKBIP 1.167 6.532 33 FAM20C FAM20C 1.166 8.548 68 FTL FTL 1.165 6.762 164 GSTO1 GSTO1 1.165 5.209 125 SEMA7A SEMA7A 1.162 5.707 225 EID1 EID1 1.157 4.685 282 SCARF2 SCARF2 1.155 4.434 117 C4orf3 C4orf3 1.154 5.817 158 NAGS NAGS 1.154 5.323 159 MYADM MYADM 1.154 5.304 214 COL1A1 COL1A1 1.154 4.789 266 BCAS4 BCAS4 1.154 4.529 93 LAMP2 LAMP2 1.152 6.26 100 DCN DCN 1.151 6.115 148 SLC43A3 SLC43A3 1.15 5.478 30 NR3C1 NR3C1 1.149 8.614 187 PLD2 PLD2 1.147 5.008 267 WDR53 WDR53 1.147 4.527 24 YBX1 YBX1 1.145 9.76 170 PIP4K2A PIP4K2A 1.145 5.157 67 UNC13D UNC13D 1.144 6.769 221 FOLR2 FOLR2 1.144 4.71 77 PTP4A2 PTP4A2 1.143 6.577 26 TNFRSF10C TNFRSF10C 1.142 8.802 199 FIBCD1 FIBCD1 1.142 4.91 145 TPM2 TPM2 1.14 5.534 130 PSMA7 PSMA7 1.134 5.649 223 PDIA2 PDIA2 1.133 4.703 133 NR2F2 NR2F2 1.132 5.604 271 CMBL CMBL 1.132 4.513 163 MICALCL MICALCL 1.129 5.222 203 IPO5 IPO5 1.129 4.878 206 TNS1 TNS1 1.123 4.864 247 KCTD20 KCTD20 1.123 4.607 265 GNA12 GNA12 1.122 4.529 132 ASB1 ASB1 1.119 5.613 231 ADC ADC 1.119 4.662 198 SPOCK2 SPOCK2 1.116 4.914 52 CDK6 CDK6 1.112 7.313 233 RBFOX2 RBFOX2 1.112 4.656 229 ST3GAL2 ST3GAL2 1.111 4.678 192 COMT COMT 1.108 4.986 235 SEMA6B SEMA6B 1.106 4.653 51 HSP90AA1 HSP90AA1 1.104 7.362 239 FCGR2C FCGR2C 1.104 4.628 224 ALPK2 ALPK2 1.103 4.694 71 MAST4 MAST4 1.102 6.715 110 IGHA1 IGHA1 1.102 5.944 41 COL6A1 COL6A1 1.101 7.678 216 LOC152225 LOC152225 1.101 4.774 210 RABGGTB RABGGTB 1.1 4.834 Pathways List for Example 1: Gene Expression Signatures Associated with Endometriosis

Genes Dysregulated (/Genes in Dysregulated Genes Log10p Name Pathway) (p ≤ 0.005) Magnitude value FDR Chemokine 22/184 ADCY8; ARRB1; BRAF; 13.43 2.77 0.03 signaling pathway CCL3, 3L1, 7; CXCL2, 12; CXCR5; DOCK2; GNG3, 7, 10; JAK3; LYN; NCF1; PIK3CD; PLCB4; PRKACG; RAP1A; ROCK1; SHC3 Focal adhesion 29/200 ACTG1, N3; BCL2; BRAF; 9.4 3.13 0.03 CAV2; COL1A1, 1A2, 5A1, 5A2, 6A1, 6A2, 6A3; EGFR; FN1; ITGB3; JUN; LAMA1; MYL9; PAK6; PARVA; PDGFA; PIK3CD; PPP1R12A; PTEN; RAP1A; ROCK1; SHC3; THBS2; VCL Cytokine-cytokine 24/260 CCL3, 3L1, 7; CD40, 70; 5.13 4.51 0 receptor interaction CSF1R; CXCL2, 12; CXCR5; EGFR; EPOR; IFNA5; IL12RB1, 17B, 20RB; OSM; PDGFA; PRL; TNFRSF10A, 10C, 10D, 13B; TNFSF9, 12 Staphylococcus 11/52  C1QB; C1R; C1S; C3; CFH; 4.36 2.92 0.03 Aureus Infection FCGR2C; HLA-DPB1, -DRB3; ITGB2; KRT10; SELPLG Leukocyte 18/116 ACTG1; ACTN3; CD99; 3.71 2.05 0.11 transendothelial CLDN4, 5; CXCL12; EZR; migration ITGB2; MAPK11; MSN; MYL9; NCF1; NCF2; OCLN; PIK3CD; RAP1A; ROCK1; VCL ECM-receptor 12/84  COL1A1, 1A2, 5A1, 5A2, 3.49 2.89 0.03 interaction 6A1, 6A2, 6A3; FN1; ITGB3; LAMA1; SV2C; THBS2 Systemic lupus 26/124 ACTN3; C1QB; C1R; C1S; 2.65 5.42 0 erythematosus C3; CD40; FCGR2C; H2AFJ; H2AFX; HIST1H2AC, AD, AE, BD, BE, BF, BG, BK; HIST1H3E; HIST1H4A; HIST2H2AB, AC, BE, BF; HLA-DPB1, -DRB3; SSB Osteoclast 21/128 ACP5; AKT2; FHL2; FYN; 1.31 1.95 0.13 Differentiation GAB2; JAK1; JUNB; LILRB1; MAPK8; NFATC1; PIK3CG; PIK3R1, R3; PPP3CA; SOCS3; TEC; TGFB2; TNFRSF1A, 11B Sulfur relay system 3/10 CTU2; NFS1; TST −0.54 1.39 0.42 Pathogenic E. Coli 13/56  ACTG1; ARPC2; EZR; −2.23 2.68 0.03 infection LY96; NCL; OCLN; ROCK1; TUBA3E; TUBB, 2B, 6, 8; YWHAQ

Example 2: Identification of Phase-Specific Genetic Signatures for Endometriosis

The ambiguous knowledge of the mechanisms of endometriosis development complicates its treatment. The accepted mechanism for endometriosis is retrograde menstruation, which is the backflow of menstrual fluid and associated endometrial cells through the fallopian tubes. See FIG. 3. As shown in FIG. 4, retrograde menstruation causes adhesion and proliferation of endometrial cells outside of the uterine. The ectopic endometrial growth causes inflammation, immune system evasions, and EP dependence/P4 resistances. FIG. 4 also sets forth other potential contributory factors of endometriosis, including the presence of Mullerian rests, endometrial stem cells, and metaplasia.

Methods of the invention, according to certain embodiments, rely on genetics and bioinformatics in order to identify clinically significant genetic signatures of endometriosis. The genetic signatures, determined via methods of the invention, can be used to classify a subject's clinical condition (e.g. uterine phase or grade of endometriosis) and can be used to target treatment.

Data Set

A meta-analysis was conducted to combined and correlate phase-specific micro-array data of several different endometrial studies. The following table lists the studies, type of microarray and the number of patients. Incorporating all of the studies, the meta-analysis analyzed data from 106 samples from 61 patients. The data from the study was subject to a meta-analysis as previously described. FIG. 5 illustrates the clinical parameters that were assessed for the micro-array studies: age, presence/absence of endometriosis, stage of the disease, presence of pain, gravidty/parity, endometrioma position, tissue sampling method, phase of the uterine cycle, ethnicity, and leiomyata.

Study Type of Array #Patients Burney, Richard O., et al. “Gene Affymetrix, 21 expression analysis of endometrium HG U133 + reveals progesterone resistance and 2.0 candidate susceptibility genes in women with endometriosis.” Endocrinology 148.8 (2007): 3814- 3826 Crispi, Stefania, et al. “Transcriptional Affymetrix, 8 profiling of endometriosis tissues U133A2.0 identifies genes related to organogenesis defects.” Journal of cellular physiology 228.9 (2013): 1927-1934 Eyster, Kathleen M., et al. “Whole GE/Amersham 11 genome deoxyribonucleic acid CodeLink, microarray analysis of gene expression Human HG in ectopic versus eutopic endometrium.” Fertility and sterility 88.6 (2007): 1505-1533 Hever, Aniko, et al. “Human Affymetrix, 10 endometriosis is associated with HG U133 + plasma cells and overexpression of B 2.0 lymphocyte stimulator.” Proceedings of the National Academy of Sciences 104.30 (2007): 12451-12456 Hull, M. Louise, et al. “Endometrial- Affymetrix, 9 peritoneal interactions during HG U133A endometriotic lesion establishment.” The American journal of pathology 173.3 (2008): 700-715 Talbi, S., et al. “Molecular Affymetrix, 16 phenotyping of human endometrium HG U133 + distinguishes menstrual cycle phases 2.0 and underlying biological processes in normo-ovulatory women.” Endocrinology 147.3 (2006): 1097- 1121.

Results

Based on the meta-analysis, the parameters that dominated gene expression patterns include: 1) the phase of the uterine cycle and 2) the presence/absence of endometriosis. FIG. 6A illustrates gene expression of the eutopic endometrium of the samples across the proliferative, early secretory, mid-secretory, and late secretory phases. FIG. 6B illustrates gene expression of the ectopic endometrium across the proliferative, early secretory, mid-secretory, and late secretory phases. K-means clustering analysis of the micro-array data indicated showed that genetic signatures that ectopic and eutopic endometrial tissue were phased dependent in different ways.

The endometriosis phase-specific expression patterns were compared to the normal phase-specific expression patterns in order to identify genetic expression signatures specific to endometriosis and specific to a certain phase of the uterine cycle. FIG. 7 illustrates the phase-specific genetic signature differences between endometriosis and normal populations. The proliferative phase showed 430 genes with a fold-change greater than 2.0 (P-value less than 0.0005), the early secretory phase showed 151 genes with a fold-change greater than 2.0 (P-value less than 0.0005), and the mid-late secretory phase showed 3 genes with a fold-change greater than 2.0 (P-value less than 0.0005).

As illustrated in FIGS. 8-10, the meta-analysis revealed that certain genes of the endometriosis samples are up-regulated and down-regulated as compared to the normal. In addition, the misregulation of genes of the endometriosis sample varied across the different phases. The phase-specific regulation pattern of genes associated with endometriosis can be used as a regulation signature of the disease. FIG. 8 illustrates up-regulated and de-regulated genes associated with endometriosis at the proliferative stage. FIG. 9 illustrates up-regulated and de-regulated genes associated with endometriosis at the early secretory stage. FIG. 10 illustrates up-regulated genes associated with endometriosis at the mid to late secretory phase.

Discussion

The phase-specific endometriosis signatures identified using methods of the invention can be used as biomarkers for the disease and to guide course of treatment. Additional information can be correlated into the meta-analysis to obtain phase-specific endometriosis signatures associated with particular parameters, for example, age, stage of endometriosis, infertility, and other phenotypic traits. The clinical applications of the phase-specific endometriosis signatures are discussed hereinafter.

In certain embodiments, the phase-specific endometriosis signatures can be utilized to target diagnosis of endometriosis. For example, the expression levels of transcripts in one or more samples obtained from a patient suspected of having endometriosis can be compared to known phase-specific endometriosis signatures. The samples can be obtained at a particular phase or across several time-points of the patient's uterine cycle. The expression levels can be compared to signatures corresponding to one phase or diverse group of signatures from the various phases of the uterine cycle. Similarities between the patient's expression level and the phase-specific endometriosis signatures are the patient's phase-specific endometriosis signature and indicate that the patient has endometriosis. A course of treatment can be chosen that is tailored to the patient's phase-specific endometriosis signatures. For example, drugs may be recommended or prescribed to the patient to coincide with the phase in which the patient has endometriosis signatures. In addition, drugs may be recommended or prescribed to the patients that are known to target the gene or the biochemical pathways associated with the gene. In instances where the phase-specific endometriosis signatures is also keyed to a particular grade (i.e. severity) of endometriosis, the comparison between the patient's expression pattern and the endometriosis signatures may be indicative of the grade of the patient's endometriosis.

The phase-specific genetic signatures can also be applied to identify and chart the patient's specific uterine cycle. The uterine cycle is very individual specific-ranging between 21 days to 35 days, with the norm being 28 days. In addition, the length of the phases of the uterine cycle likewise varies among individuals. Since treatment of endometriosis may be implicated for certain phases, the ability to genetically confirm the phase of an individual to direct the timing of treatment is advantageous. According to methods of the invention, the patient's expression levels across different time-points can be compared phase-specific endometriosis signatures to determine the timing of the patient's uterine cycle. For instances, correlations between the patient's expression levels and signatures of a particular phase are indicative of the phase of the patient. Utilizing genetics to determine the timing of a patient's uterine cycle provides benefits such as being able to tailor treatment of a variety of reproductive conditions, including the treatment of infertility, premenstrual dysphoric disorder, and endometriosis. In various scenarios, a better understanding of the timing of one's uterine cycle provides greater insight into the hormonal state of the patient, which may guide hormone treatment regimens.

Example 3

Though an effect of obesity on IVF success rate seems likely, there is disagreement about the precise nature of the relationship between these two parameters. Articles differ in their approach to address this question: Many of them focus on patients with specific infertility diagnoses, while others have no inclusion criteria. To determine how obesity affects IVF success rates for patients, and to determine whether this relationship differs between patients with different infertility diagnoses, relationships between obesity and an increased risk of IVF treatment failure were investigated among women with different infertility diagnoses. A retrospective analysis was employed using de-identified fresh and cryo-thawed self IVF cycles (N=5208, 2738 patients) from a large reproductive medical center.

Methods:

A Reproductive Medicine Associates of New York, LLP dataset of 5208 cycles was used for the analysis. Logistic regression models were created and controlled for age, day 3 follicle stimulating hormone (FSH), peak estradiol level, number of oocytes retrieved, number of embryos transferred, and whether intra-cytoplasmic sperm injection (ICSI) procedure was performed).

The infertility diagnoses included in the analysis were diminished ovarian reserve, endometriosis, idiopathic, male factor, PCOS, and tubal factor (Table 1).

TABLE 1 Sample information for diagnoses included cy- obese obese % obese p- diagnosis cles cycles patients patients patients value Overall 5208 648 2738 344 0.124 DOR 1065 94 615 61 0.088 0.97 Endometriosis 327 13 170 9 0.04 0.99 Idiopathic 1347 140 705 66 0.104 1 Male Factor 1492 218 742 105 0.146 0.1 PCOS 439 93 223 47 0.212 0.0005 Tubal 538 90 283 56 0.167 0.0006 Factor (NB: p-value is obtained from a proportion-test, which checks if the proportion of obese patients for a given diagnosis is greater than in the data combining all six diagnoses.)

Results:

Both clinical pregnancy and live birth outcome were correlated with obesity across all patients, defining obesity as BMI 30 kg/m2 and non-obese as BMI<30 kg/m2. For data combining patients of all diagnoses, there was no correlation between obesity on clinical pregnancy [Table 2] or live birth outcome [Table 3].

The analysis of repeated, this time breaking the cohort down by diagnosis, comparing clinical pregnancy and live birth outcome rates, in relation to obesity. PCOS was found to be the only diagnosis in which a relationship between obesity and clinical pregnancy (OR=0.57, p=0.03) [Table 2], and live birth outcome (OR=0.44, p=0.02) exists [Table 3].

TABLE 2 Change in likelihood of Clinical Pregnancy outcome given presence of patient obesity. Likelihood of Clinical Pregnancy if patient is obese Diagnosis Odds Ratio P-value All Diagnoses 1.08 0.43 PCOS 0.57 0.03 Male Factor 1.03 0.86 DOR 1.32 0.32 Tubal Factor 1.38 0.19 Endometriosis 1.07 0.92 Idiopathic 1.01 0.96

TABLE 3 Change in likelihood of Live Birth outcome given presence of patient obesity. Likelihood of Live Birth if patient is obese Diagnosis Odds Ratio P-value All Diagnoses 0.95 0.71 PCOS 0.44 0.02 Male Factor 1.33 0.27 DOR 0.94 0.88 Tubal Factor 2.35 0.1 Endometriosis 0.74 0.72 Idiopathic 0.71 0.22

As a secondary analysis, a specific point along an IVF cycle was determined where the effects of obesity become significant. To do this, ‘landmarks’ such as number of oocytes retrieved, rates of embryo development, number of embryos transferred and implantation rate were correlated with obesity. This analysis was repeated using data subset for different common infertility diagnoses, to determine what parts of the cycle are most affected by obesity, for each diagnosis.

Since obesity was found to have an effect on all outcomes post ET in the PCOS population, further analysis was used to pinpoint where the effect manifested. To achieve that, implantation rate less than 50% (in addition to the standard confounding variables) was controlled for in the LB outcome analysis.

Obesity was not correlated significantly with any IVF cycle ‘landmarks’ between oocyte retrieval and embryo transfer, for any diagnosis. This result indicates that the effect of obesity on IVF outcome occurs after embryo transfer takes place.

Implantation rate was significantly adversely correlated with presence of obesity for PCOS patients, but not for other diagnoses. In investigating whether implantation rate less than 50% was correlated with obesity, it was determined that, for PCOS patients, Implantation Rate<50% was almost twice as likely if the patient was obese (OR=1.82, p=0.02) [Table 4]. This result supports the hypothesis that the influence of obesity on IVF success for PCOS patients occurs after embryo transfer.

TABLE 4 Likelihood of implantation rate less than 50%, given presence of patient obesity. Likelihood of Implantation Rate < 50% if patient is obese Diagnosis Odds Ratio P-value All Diagnoses 0.95 0.61 PCOS 1.82 0.02 Male Factor 0.85 0.33 DOR 0.76 0.42 Tubal Factor 0.63 0.06 Endometriosis 0.84 0.78 Idiopathic 1.30 0.77

Having found that obesity is correlated with reduced implantation rate for PCOS patients, it was then investigated whether the effect on live birth outcome occurred independently of its reduction of implantation rate, or if a reduced implantation rate was the source of the negative effect on Live Birth.

Analysis indicated that obesity's negative impact on implantation rate is the source of its negative effect on live birth, and not merely an independent effect.

TABLE 5 Effects of both ‘implantation rate less than 50%’ and obesity on live birth outcome for PCOS patients. Effect on live birth outcome (PCOS patients) obesity OR = 0.53, p = 0.2 implantation rate OR = 0.2, p < 10{circumflex over ( )}−9 less than 50%

Detailed Data for Example 3: 1) Clinical Outcomes

Retrieved~obese + SrgFollicleslessthanEq14 + Age + FSHMax + PeakE2 effect p-value n n diagnosis (obese) (obese) lwr upr (cycles) (obese) male 1.09 0.02 1.02 1.16 — factor PCOS 0.9 0.07 0.81 1.01 — Idiopathic 1.07 0.14 0.98 1.16 — Tubal 1 0.99 0.91 1.1 — Factor DOR 1.02 0.7 0.9 1.16 — endo- 0.97 0.86 0.71 1.34 — metriosis overall 1.04 0.07 1 1.08 4513 567 family = poisson data includes Retrieved = 0 data includes EmbryosTransferred = 0 data excludes frozen cycles effect p-value n n diagnosis (obese) (obese) lwr upr (cycles) (obese) CountViable~obese + Retrieved + Age + FSHMax + PeakE2 + icsi + cryo male 1 0.99 0.91 1.1 1492 218 factor PCOS 0.9 0.16 0.78 1.04 439 93 Idiopathic 0.97 0.67 0.87 1.09 1347 140 Tubal 1.13 0.06 1 1.29 538 90 Factor DOR 0.91 0.21 0.78 1.06 1065 94 endo- 0.93 0.73 0.62 1.4 327 13 metriosis overall 1 0.92 0.95 1.06 5208 648 EmbryosTransferred~obese + Age + FSHMax + PeakE2 + icsi + cryo male 1.1 0.02 1.02 1.19 1492 218 factor PCOS 0.99 0.82 0.88 1.11 439 93 Idiopathic 0.97 0.67 0.87 1.09 1347 140 Tubal 0.98 0.72 0.87 1.1 538 90 Factor DOR 0.88 0.12 0.75 1.03 1065 94 endo- 0.77 0.09 0.57 1.04 327 13 metriosis overall 1.03 0.28 0.98 1.08 5208 648 family = poisson data includes Retrieved = 0 data includes EmbryosTransferred = 0 OR p-value n n diagnosis (obese) (obese) lwr upr (cycles) (obese) ImplantationRatelessthan50~obese + Retrieved + EmbryosTransferred + cryo + Age + FSHMax + PeakE2 + icsi male 0.85 0.33 0.61 1.18 1492 218 factor PCOS 1.81 0.02 1.11 2.96 439 93 Idiopathic 1.29 0.24 0.84 1.98 1347 140 Tubal 0.63 0.06 0.38 1.02 538 90 Factor DOR 0.76 0.42 0.39 1.47 1065 94 endo- 0.84 0.78 0.24 2.96 327 13 metriosis overall 0.84 0.78 0.24 2.96 5208 648 ClinPregOutcome~obese + Retrieved + EmbryosTransferred + cryo + Age + FSHMax + PeakE2 + icsi male 1.03 0.86 0.74 1.45 1492 218 factor PCOS 0.57 0.03 0.34 0.94 439 93 Idiopathic 1.01 0.96 0.69 1.48 1347 140 Tubal 1.38 0.19 0.85 2.25 538 90 Factor DOR 1.32 0.32 0.76 2.3 1065 94 endo- 1.07 0.92 0.28 4.17 327 13 metriosis overall 1.08 0.43 0.89 1.3 5208 648 LiveBirthOutcome~obese + Retrieved + EmbryosTransferred + cryo + Age + FSHMax + PeakE2 + icsi male 1.33 0.27 0.81 2.19 1492 218 factor PCOS 0.44 0.02 0.21 0.9 439 93 Idiopathic 0.71 0.22 0.41 1.22 1347 140 Tubal 2.35 0.1 0.84 6.56 538 90 Factor DOR 0.94 0.88 0.43 2.05 1065 94 endo- 0.74 0.72 0.14 3.93 327 13 metriosis overall 0.95 0.71 0.72 1.25 5208 648 family = binomial data includes Retrieved = 0 data includes EmbryosTransferred = 0 AnyEmbryosTransferred~obese + Age + FSHMax + PeakE2 + icsi + cryo effect p-value n n diagnosis (obese) (obese) lwr upr (cycles) (obese) male 1.65 0.12 0.88 3.08 1492 218 factor PCOS 0.57 0.17 0.26 1.26 439 93 Idiopathic 0.96 0.88 0.55 1.69 1347 140 Tubal 1.15 0.77 0.44 2.99 538 90 Factor DOR 0.84 0.55 0.48 1.47 1065 94 endo- 0.48 0.39 0.09 2.54 327 13 metriosis overall 1.03 0.28 0.98 1.08 5208 648 family = binomial data includes Retrieved = 0 data includes EmbryosTransferred = 0 LB~obese + . . . + controlling for ImplantationRatelessthan50 LiveBirthOutcome~obese + Retrieved + EmbryosTransferred + cryo + Age + FSHMax + PeakE2 + icsi + ImplantationRatelessthan50 OR p-value OR p-value n n diagnosis (obese) (obese) lwr upr (imprateLT50) (imprateLT50) lwr upr (cycles) (obese) male 1.245 0.421 0.730 2.126 0.073 0.000 0.047 0.115 1492.000 218.000 factor PCOS 0.534 0.197 0.206 1.384 0.033 0.000 0.015 0.073 439.000 93.000 Idiopathic 0.964 0.914 0.497 1.870 0.051 0.000 0.030 0.087 1347.000 140.000 Tubal 1.914 0.237 0.652 5.616 0.081 0.000 0.034 0.195 538.000 90.000 Factor DOR 0.708 0.475 0.275 1.824 0.109 0.000 0.058 0.205 1065.000 94.000 endo- 0.406 0.194 0.104 1.581 0.081 0.000 0.035 0.190 327.000 13.000 metriosis overall 0.946 0.728 0.693 1.292 0.073 0.000 0.057 0.093 5208.000 648.000 family = binomial data includes Retrieved = 0 data includes EmbryosTransferred = 0 LiveBirthOutcome~obese + Retrieved + EmbryosTransferred + cryo + Age + FSHMax + PeakE2 + icsi + ImplantationRatelessthan50 OR p OR p diagnosis (obese) (obese) (imprate < 50) (imprate < 50) male 1.25 0.42 0.07 <10{circumflex over ( )}−9 factor PCOS 0.53 0.2 0.03 <10{circumflex over ( )}−9 Idiopathic 0.96 0.91 0.05 <10{circumflex over ( )}−9 Tubal 1.91 0.24 0.08 <10{circumflex over ( )}−7 Factor DOR 0.71 0.48 0.11 <10{circumflex over ( )}−9 endo- 0.41 0.19 0.08 <10{circumflex over ( )}−8 metriosis overall 0.95 0.73 0.07 <10{circumflex over ( )}−9

2) Oocyte/Embryo Development Outcomes (Conditional on Retrieval)

effect p-value n n diagnosis (obese) (obese) lwr upr (cycles) (obese) Countof2PN~obese + Retrieved + Age + FSHMax + PeakE2 + icsi male 0.98 0.53 0.91 1.05 1255 184 factor PCOS 0.93 0.19 0.84 1.04 332 77 Idiopathic 1.02 0.6 0.94 1.11 1176 123 Tubal 1.11 0.01 1.03 1.21 456 81 Factor DOR 1 0.96 0.86 1.15 1016 89 endo- 1.03 0.82 0.82 1.28 274 11 metriosis overall 1.01 0.5 0.97 1.06 4509 565 CountofDeg~obese + Retrieved + Age + FSHMax + PeakE2 + icsi male 1.09 0.44 0.88 1.35 1255 184 factor PCOS 1.49 0.08 0.96 2.32 332 77 Idiopathic 1.32 0.07 0.98 1.78 1176 123 Tubal 1.21 0.28 0.85 1.72 456 81 Factor DOR 0.89 0.61 0.58 1.37 1016 89 endo- 1.53 0.24 0.75 3.11 274 11 metriosis overall 1.16 0.04 1.01 1.33 4509 565 CountAbnormal~obese + Retrieved + Age + FSHMax + PeakE2 + icsi male 1.05 0.7 0.81 1.37 1255 184 factor PCOS 1.07 0.67 0.78 1.48 332 77 Idiopathic 1 0.97 0.8 1.23 1176 123 Tubal 0.96 0.79 0.73 1.27 456 81 Factor DOR 1.08 0.61 0.8 1.47 1016 89 endo- 0.78 0.5 0.38 1.6 274 11 metriosis overall 1.03 0.6 0.92 1.16 4509 565 family = poisson data does not include Retrieved = 0 data includes EmbryosTransferred = 0

3) Oocyte/Embryo Development Outcomes (Conditional on Retrieval, Grouped on Retrieved)

effect p-value n n diagnosis (obese) (obese) lwr upr (cycles) (obese) binary_Countof2PN~obese + Age + FSHMax + PeakE2 + Retrieved + icsi male 1.03 0.69 0.89 1.19 1255 184 factor PCOS 0.89 0.38 0.69 1.15 332 77 Idiopathic 0.99 0.95 0.84 1.17 1176 123 Tubal 1.38 0 1.14 1.68 456 81 Factor DOR 0.97 0.84 0.7 1.33 1016 89 endo- 1.22 0.44 0.73 2.03 274 11 metriosis binary_CountofDeg~obese + Age + FSHMax + PeakE2 + Retrieved + icsi male 1.18 0.19 0.92 1.52 1255 184 factor PCOS 1.73 0.04 1.02 2.96 332 77 Idiopathic 1.13 0.54 0.76 1.68 1176 123 Tubal 1.08 0.72 0.7 1.67 456 81 Factor DOR 0.88 0.63 0.54 1.45 1016 89 endo- 1.54 0.29 0.69 3.43 274 11 metriosis binary_CountofAbnormal~obese + Age + FSHMax + PeakE2 + Retrieved + icsi male 1.03 0.83 0.76 1.4 1255 184 factor PCOS 1.18 0.39 0.81 1.71 332 77 Idiopathic 0.97 0.81 0.74 1.26 1176 123 Tubal 0.97 0.87 0.71 1.33 456 81 Factor DOR 1.16 0.43 0.8 1.68 1016 89 endo- 0.93 0.8 0.52 1.66 274 11 metriosis family = binomial data does not include Retrieved = 0 data includes EmbryosTransferred = 0

4) Oocyte/Embryo Development Outcomes (Conditional on Retrieval)

effect p-value n n diagnosis (obese) (obese) lwr upr (cycles) (obese) Countof2PN~obese + Retrieved + Age + FSHMax + PeakE2 + icsi + CountofM2 male 0.97 0.49 0.9 1.05 1255 184 factor PCOS 0.94 0.2 0.85 1.04 332 77 Idiopathic 1.03 0.52 0.95 1.12 1176 123 Tubal 1.11 0.01 1.02 1.21 456 81 Factor DOR 0.99 0.94 0.86 1.14 1016 89 endo- 1.01 0.91 0.81 1.26 274 11 metriosis overall 1.01 0.59 0.97 1.05 4509 565 CountofDeg~obese + Retrieved + Age + FSHMax + PeakE2 + icsi + CountofM2 male 1.09 0.44 0.88 1.35 1255 184 factor PCOS 1.49 0.07 0.97 2.28 332 77 Idiopathic 1.31 0.08 0.97 1.77 1176 123 Tubal 1.29 0.12 0.94 1.78 456 81 Factor DOR 0.91 0.67 0.6 1.39 1016 89 endo- 1.55 0.24 0.75 3.18 274 11 metriosis overall 1.16 0.03 1.01 1.34 4509 565 CountAbnormal~obese + Retrieved + Age + FSHMax + PeakE2 + icsi + CountofM2 male 1.05 0.69 0.81 1.37 1255 184 factor PCOS 1.06 0.71 0.77 1.47 332 77 Idiopathic 1 0.99 0.81 1.24 1176 123 Tubal 0.95 0.74 0.72 1.26 456 81 Factor DOR 1.08 0.61 0.8 1.47 1016 89 endo- 0.68 0.3 0.33 1.41 274 11 metriosis overall 1.03 0.64 0.91 1.16 4509 565 family = poisson data does not include Retrieved = 0 data includes EmbryosTransferred = 0 5) Oocyte/Embryo Development Outcomes (Conditional on Retrieval, Grouped on MII)

Example 4

High aneuploidy rates are often associated with poor oocyte and embryo quality, both of which decrease with age. As with aneuploidy, FSH levels also rise with age; however, no direct link has been demonstrated between FSH levels and aneuploidy. A large cohort of retrospective pre-implantation genetic screening (PGS) data was studied to clarify the respective contributions of FSH and age to aneuploidy.

Patients analyzed included those with partners of normal karyotype, who underwent fresh in vitro fertilization (IVF) cycles in which 1 oocyte was retrieved, PGS was performed, and day 3 FSH levels were known for the cycle. The effects of patients' age and FSH levels (assessed both as a continuous variable and above/below a threshold of 13 mUI/mL) were correlated with aneuploidy status using generalized estimation equation (GEE) models.

A total of 462 patients with 2207 embryos were analyzed. Overall, patients with normal ploidy were younger (35.5±4.0 vs.38.1±4.4) and had a lower basal FSH level (7.56±3.6 vs. 8.1±3.5) compared to those with aneuploidy. The odds of aneuploidy increased by 10% for each year of a woman's reproductive lifespan (OR=1.1, p<0.0001). No independent contribution of FSH levels to odds of aneuploidy was found when assessed as a continuous variable (p=0.75) or when considered above a threshold of 13 (p=0.45). However, it was observed that for women with FSH levels above 13 mUI/mL, their odds of aneuploidy increased at a substantially higher rate (50%) for each additional year (OR=1.52, p<0.0001) of life.

The findings suggest that equivalent FSH levels should not be directly equated with egg quality in women of different age. This has significant implications for the management of infertility in younger women with elevated FSH levels. Also, these women might benefit from earlier treatment intervention and egg/embryo banking, given that their odds of aneuploidy rise more rapidly over time than women of the same age without elevated FSH levels. 

1.-9. (canceled)
 10. A method for assessing endometriosis, the method comprising: a. sequencing nucleic acid obtained from a blood or tissue sample derived from a subject to determine a level of a transcript of one or more endometrial-associated gene(s) present in the blood or tissue sample, b. analyzing said levels of transcripts against a reference regulation pattern specific to a time-point in a uterine cycle using a computer system having a processor, thereby to generate a patient-specific signature, the reference regulation pattern being generated using gene expression data obtained from the one or more gene(s) across each phase of the uterine cycle from a reference population that includes both normal patients and endometriosis patients; and c. scoring an endometriosis status of the subject based upon said patient-specific signature.
 11. The method of claim 10, wherein the one or more endometrial-associated gene(s) comprise CCL3L1, CCL3, FAM180A, THBS2, PDGFRL, FN1, CLE11A, CCNA2, KIF20A, BUB1B, HSD17B6, HSD11B1, C7, C3, CXCL2, CXCL12, CXCL13, PDGFC, CXCL14, ACTA2, TAGLN, ROBO3, MT1M, or SORBS1.
 12. The method of claim 10, wherein the one or more endometrial-associated gene(s) comprise CCNA2, KIF20A, BUB1B, CXCL13, ACTA2, TAGLN, MT1M, SORBS1.
 13. The method of claim 10, wherein the one or more endometrial-associated gene(s) comprise SORBS1, TAGLN, or ACTA2.
 14. The method of claim 10, wherein the blood or tissue sample endometrial tissue.
 15. The method of claim 10, wherein the blood or tissue sample is a blood sample.
 16. The method of claim 10, wherein the time-point comprises a phase in the uterine cycle.
 17. A method of treating infertility in a woman, the method comprising: subjecting the woman to an in vitro fertility (IVF) treatment, the in vitro fertility (IVF) treatment implanting an embryo into the woman, the woman having a score of likelihood of implantation, clinical pregnancy, and/or live birth outcomes in potential IVF treatment, the score having been generated by: i. sequencing nucleic acid obtained from a blood or tissue sample derived from the subject to determine a level of a transcript of one or more infertility-associated gene(s) present in the blood or tissue sample obtained from the subject, ii. analyzing said levels of transcripts against a reference regulation pattern specific to a time-point in a uterine cycle using a computer system having a processor, thereby to generate a patient-specific signature, the reference regulation pattern being generated using gene expression data obtained from the one or more infertility-associated gene(s) across each phase of the uterine cycle from a reference population that includes both normal patients and infertile patients; and iii. scoring the likelihood of implantation, clinical pregnancy, or live birth outcomes in potential IVF treatment of the subject based upon said patient-specific signature.
 18. The method of claim 17, wherein the one or more infertility-associated gene(s) comprise CCL3L1, CCL3, FAM180A, THBS2, PDGFRL, FN1, CLE11A, CCNA2, KIF20A, BUB1B, HSD17B6, HSD11B1, C7, C3, CXCL2, CXCL12, CXCL13, PDGFC, CXCL14, ACTA2, TAGLN, ROBO3, MT1M, or SORBS1.
 19. The method of claim 17, wherein the one or more infertility-associated gene(s) comprise CCNA2, KIF20A, BUB1B, CXCL13, ACTA2, TAGLN, MT1M, SORBS1.
 20. The method of claim 17, wherein the one or more infertility-associated gene(s) comprise SORBS1, TAGLN, or ACTA2.
 21. The method of claim 17, wherein the blood or tissue sample is a tissue sample.
 22. The method of claim 21, wherein the tissue sample comprises endometrial tissue.
 23. The method of claim 22, wherein the endometrial tissue is ectopic, eutopic, or both.
 24. The method of claim 17, wherein the blood or tissue sample is a blood sample.
 25. The method of claim 17, wherein the reference regulation pattern is specific to ectopic tissue, eutopic tissue, or both.
 26. The method of claim 17, wherein the time-point comprises a phase in the uterine cycle.
 27. The method of claim 26, wherein the phase is selected from the group consisting of the menstruation phase, the proliferative phase, the early secretory phase, the mid-secretory phase, and the late secretory phase.
 28. The method of claim 17, wherein the reference regulation pattern comprises one or more transcripts selected from the group consisting of de-regulated transcripts, up-regulated transcripts, and combinations thereof.
 29. The method of claim 17, wherein the analyzing step comprises determining the subject's phase in the uterine cycle. 